Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3221 |
Symbol | |
ID | 5671597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3802106 |
End bp | 3803347 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641242115 |
Product | ABC-type branched-chain amino acid transport systems periplasmic component-like protein |
Protein accession | YP_001507535 |
Protein GI | 158315027 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCCAGA GACGATTAAG ATCATTTGTC GCGATGGCGG GTGCCGTTGC GGTGCTGGTG GCGGCTGCCG GCTGCGGCGG CTCGTCGTCG GACGGGAGTG GTGGTGATGC GGAGGCGCAG GGTGGCAAGA CCTACACGAT CGGGGTCTTA GCCGATATCA CGGGCCCGGC GGCGTCCGGG AACGAGACCA GCGTCGAGGG CGTCAAGGCG GGGACGTACT ACGCCGAGCG CGAGGGAATC AAGATCAAGT ACATCGTGGC CGACACGGCG ACGAATCCGA CGACCGCGCT CTCGGCCGCG CAGAAGCTGG TCACGCAGGA TCACGTGTTC GCCGTGATCG CCCACTCGGC GATCACGTTC TCCGCGGCTT CCTACCTCAC CGCTCAGAAG GTCCCGGTCA TCGGTTTCGC CCAGGACGGC CGAGAGTGGT TCACGTCCCC GAACATGTTC TCGATCACCG GGCCGACGAT CGACAAAGAA GTCACGACGA CGATGGGCGA GTTCTTCAAG TCGAAGGGGG CGACCAGCAT CGCCTCGATC GGTTACTCGG TCTCGCCCCA GTCGCAGGCC TCGGCGCTCG AGACGGCGGA GTCGGCCAGA CTCGCGGGTG TCAAAATAGG CTACGTCAAC GCGCAGCTCC CGTTCGGTAG CACCGATGTC GGTCCGACAG TGCTGGCCAT GAAGGAAGCC AAGATCGATT CCTTCTTCGC CGCGGTCGAC CCGAACACCG CCTTCGCTCT CATCTCCGGC CTGGAACAGC AGGGCGTGGA CATCAAGGTG GCGCTGCTGC CCACCGGCTA TGGCGGTGAC CTGGCGCAGG CTGGCCCGGG CGCGCGGCGA GCGGCTCAGG GTGTCTACTT CTCCCTCGGA TACCAGCCCG TCGAGATGCA GACAGCCGCT ACCAAGCAGT TCCAGAGCGA CCTGAAAGAA GCGGGGATCA CCGGGGCGCC GACGCTCGCG CATTACAACG GGTACATCTC GGTCGGTCTG CTCGTCCGGG CTCTCAAGGC GGCTGGCGCG GATCCGACGC CGGCATCGCT CACCAAGGCG CTCGCCGGAA TCCATGACTG GGACGGCCTC GGCCTCTACG GGGCCACGAA GTACGACCTC AGCCAGAAGA AGATCTCGAC CGGCGAGTGC CTGTTCATGA GCAGACTGGA CGGCAGCACG TTCAAGCCCG TCCCCGACGC TATCCCCATC TGCGGCGACC TGACCGACGA GAAGGTCACG CTCTCGTCCT GA
|
Protein sequence | MIQRRLRSFV AMAGAVAVLV AAAGCGGSSS DGSGGDAEAQ GGKTYTIGVL ADITGPAASG NETSVEGVKA GTYYAEREGI KIKYIVADTA TNPTTALSAA QKLVTQDHVF AVIAHSAITF SAASYLTAQK VPVIGFAQDG REWFTSPNMF SITGPTIDKE VTTTMGEFFK SKGATSIASI GYSVSPQSQA SALETAESAR LAGVKIGYVN AQLPFGSTDV GPTVLAMKEA KIDSFFAAVD PNTAFALISG LEQQGVDIKV ALLPTGYGGD LAQAGPGARR AAQGVYFSLG YQPVEMQTAA TKQFQSDLKE AGITGAPTLA HYNGYISVGL LVRALKAAGA DPTPASLTKA LAGIHDWDGL GLYGATKYDL SQKKISTGEC LFMSRLDGST FKPVPDAIPI CGDLTDEKVT LSS
|
| |