Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2244 |
Symbol | |
ID | 5670643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2681725 |
End bp | 2683155 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641241164 |
Product | ErfK/YbiS/YcfS/YnhG family protein |
Protein accession | YP_001506585 |
Protein GI | 158314077 |
COG category | [S] Function unknown |
COG ID | [COG1376] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.111887 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00847254 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTCCCGCT CGATGATGAG CTTGATACGC GCGCCCCGAG GCGCCGTTCC GTCCGGTTCC GGTTCCGGTT CCGATCAGGA TCGGGCCTCC GACCGGAGGG GGTCGCCCGC CACCCGCCGT CCGCTGGTCC GGCGCCGGTC GGAGGCCGTG CGGCCCGCCG GGATCATCCG GTCGCGGCTC GCCGCGGCCT CCGGCGTGCT CGCCGCGGCC GTGGTCCTGG CTTCCTGCTC CTCCGGGGGC GGGGGCACCG GCCCGGGCCA GAACGAGCCA CCGCGCCCGG CCGCGCCGAC CGTGACGGTG ACCCCGGCCG ACGGCGCTGC CGGCATCGCG CTCACCGAGT CGATCGTGGT GAAGAGCAAC GCGCCGCTGG CGTCGGTGAC CGTGGCCCGC GGCGCGAGCC CGACGGAGAA GACCGACCCC GGCACGCTGG AGGGGACCTT CTCCGCCGAC CGCCGGACCT GGACGTCCGC GGGTGGGTTG TTCTCCGACA CCCGCTACGA CATCCAGGCG GCCACCGCGC CGGCCCAGGG GCTGGACGGC ACCAGGAACA TCGCATCGAG CTTCACCACC GGCGTCCCGG ACAAGGCGTT CAAGGTGTCG TGGGAGCCGG TCGCCGGTCA GACCGTCGGG GTCGGCGCCC CGATCAGCCT GACCTTCAGC GCTCCGGTCA AGGACCGCGC GGCGGTGCAG AGCCGCCTGG CGGTGAACGC CGACCCGCCG GTCCTCGGCG CCTGGAACTG GATGTCGGAC CGGATGGCCG TGTGGCGCCC GCAGCAGTAC TGGGCGCCCG GGACGAAGGT GCACGTGGAG GCCAACCTCG CGGGCTTCGA CTCCGGCACC GGCTGGATCG GGGTCAAGGA CCGCTCGATG GACTTCGCGA TCGGGGCCGC CCAGATCAGC AAGGTCGACG CGGCCACCCA CGTGATGCAG GTGTTCCAGA ACGGCCAGCT CGTGCGGACC ATGCCGATCA GCGGTGGCAA GCCCGGGTTC CTGACCATGG AGGGCCCGCA CAACGTGCTG GGCAAGGCCC CGATGGTGAT CATGGACTCG GCGACGGTCG GCGTGCCGAA GGGCAACCCG GAGTACTACT ACGAAGAGGT GCAGTGGGCC GTCCACTACA CCAGCGGTGG GCAGTACGTG CACTCCGCTC CGTGGTCAGT GGCGTCGCAG GGCCGGGCGA ACGTCTCGCA CGGGTGCGTG AACGCCTCCC CGGCGGACGC GCAGTGGTTC TACAACTTCA GTCAGTTCGG CGACATCGTC GACATCAGCA ACACCGGTCG CCCGGCGGAT ACCCGGCAGC TCGGCAACGA GTGGTCCGTC CCGTGGGACA CCTGGAAGGC GGGCAGCGCG CTGCCCGTTG ACCAGCCCGC GGCCAGCGGT GCGCTGGCGG GCGCTGCGCC CGGCGCGGGG CTGCCCGCCG GTCGGACCTG A
|
Protein sequence | MSRSMMSLIR APRGAVPSGS GSGSDQDRAS DRRGSPATRR PLVRRRSEAV RPAGIIRSRL AAASGVLAAA VVLASCSSGG GGTGPGQNEP PRPAAPTVTV TPADGAAGIA LTESIVVKSN APLASVTVAR GASPTEKTDP GTLEGTFSAD RRTWTSAGGL FSDTRYDIQA ATAPAQGLDG TRNIASSFTT GVPDKAFKVS WEPVAGQTVG VGAPISLTFS APVKDRAAVQ SRLAVNADPP VLGAWNWMSD RMAVWRPQQY WAPGTKVHVE ANLAGFDSGT GWIGVKDRSM DFAIGAAQIS KVDAATHVMQ VFQNGQLVRT MPISGGKPGF LTMEGPHNVL GKAPMVIMDS ATVGVPKGNP EYYYEEVQWA VHYTSGGQYV HSAPWSVASQ GRANVSHGCV NASPADAQWF YNFSQFGDIV DISNTGRPAD TRQLGNEWSV PWDTWKAGSA LPVDQPAASG ALAGAAPGAG LPAGRT
|
| |