Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7002 |
Symbol | |
ID | 5675313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8534207 |
End bp | 8535751 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641245848 |
Product | hypothetical protein |
Protein accession | YP_001511239 |
Protein GI | 158318731 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.618001 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCTCAC TGATCCTGCT GATCATCGCA ATCTGGCTGG CCTTCTTTTT CTTCACCGTG CCCGGGCCGT TCATCGCCGG CGCCGCGGCC CTGTACGTCG CCGGGGCACT GGTCGTCGCC TATTTCCAGG AATTCGGCCG GGCGATGGGG CTGGGTCCGA ACCCCGCCGC CGCCGCACCG GAACAACCGC CGCCGCGGCG CACCGAGGAC GGGAAGGAAC CGGCCTACCG GCAGTACCTG TTCGGACAGG CCCGTCACGA TCTACGGCAT GCGCACGGTC TCGTCTGGCC GCGGCTGGCC GCGCTCGGCC GGAATCACCG GCGGTGGGTG AAGAACACCT TCTTCGGCTA CGGGGCGATG GACTGGCACT GGCCGGTGGG CGTCGTCCTG ATGGTCGGCC TGTTCGCCGG GACGATCCTG GGCCTTGCCG TCATCTCGCT GGTGGCGACG GCCCAGGGCG TGGTGCTGCT GGCCGTCTTT CTGCTGGCGT TCCTCGGGAT CTACCTGCTG CGCGGCATCG ACACCGTCCT GCTGTGGATC CGCGGGGTGC GCATCACCTG CCCGTCTTGC TACCGGCGCG GCTTCTACCC GTCCTACGAG TGCCGGAACT GCACCGTCCG CCACCACGAC GTGCGGCCGG GCAGGTACGG CGTCGTCCGG CGCGTCTGCG CGTGCGGGGA GCGCCTGCCG ACCCTGCTGC TGTTGGGCAG CCACCGGATG AACGCCTTCT GCGCGCACTG CGAGGCTCCA CTGGCCGAGA GCGTGGGCAC CGCCGCTGAG GTGGTGCTCC CGGTGTTCGG CGCGGCCGGC GCCGGCAAGA CCCGGCTGAT GATCGTCATC ATGATGGCGG TCGAGGCGAT CGCCGGACGC AGCGGCGCCA CCCTCGCCCT GGCCGACGAG GACACCCGGA AATGGGACGC CCAGGCCCGC CGCGAGCTCA TCAGGTCGGA CAAGGTCGCG AAGACCGGAA TCCGGCTTCC CCGCGCCTAC TCGCTTTACG TCGAGCCGCG GCGGGGCGGC CGGCGGCTCG TCCACGTCTT CGACCCGGCC GGCGAGTACT TCAACGAATC CGACCGCCTG CAGGAGCTGC AGTTCCTCAC CCTCGCGCGC ACCTTCCTGT TCGTCGTCGA CCCCCTGTCG GTAGACGCGC TGTGGGCCCG GCTCGACCGC GCCGACCAGA ACCGGTACAG CGGCGTCCGG GCCAGGCGCG AACCCGAGTT CGTCTTCGCG CAGACGGTCC AGAACCTCGA GGCGATGGGC GTGCGGACGA AGAAGGCCCG GCTGGTCGTG GTCGTGAGCA AACGCGACCT CGTCAGTCGG ATCCTGATCG AGGACGGCGT GGAGGACGGC GAGGAGGCCC TCGTGCGCTG GCTCGACGAG AACCTGCACC AGGGAAACAT GCTGCGGTCC ATGCGGCACG CGTTCGGCGA GGTTCAGTTC TTTCTCACCA CCTCCATCAT TTCTGACGAC AGCCGGGTCG ACGACAGCAT CGAGAAACTG ACGTCCTGGA CGCTCGCGCG GCAGGGCCTG CGGCTTTCGG GGTGA
|
Protein sequence | MGSLILLIIA IWLAFFFFTV PGPFIAGAAA LYVAGALVVA YFQEFGRAMG LGPNPAAAAP EQPPPRRTED GKEPAYRQYL FGQARHDLRH AHGLVWPRLA ALGRNHRRWV KNTFFGYGAM DWHWPVGVVL MVGLFAGTIL GLAVISLVAT AQGVVLLAVF LLAFLGIYLL RGIDTVLLWI RGVRITCPSC YRRGFYPSYE CRNCTVRHHD VRPGRYGVVR RVCACGERLP TLLLLGSHRM NAFCAHCEAP LAESVGTAAE VVLPVFGAAG AGKTRLMIVI MMAVEAIAGR SGATLALADE DTRKWDAQAR RELIRSDKVA KTGIRLPRAY SLYVEPRRGG RRLVHVFDPA GEYFNESDRL QELQFLTLAR TFLFVVDPLS VDALWARLDR ADQNRYSGVR ARREPEFVFA QTVQNLEAMG VRTKKARLVV VVSKRDLVSR ILIEDGVEDG EEALVRWLDE NLHQGNMLRS MRHAFGEVQF FLTTSIISDD SRVDDSIEKL TSWTLARQGL RLSG
|
| |