Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4563 |
Symbol | |
ID | 5672910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5443820 |
End bp | 5445094 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641243426 |
Product | putative lipoprotein |
Protein accession | YP_001508842 |
Protein GI | 158316334 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAAGA CGGAACACAG GCTCGCCACC CGGGCGACAC GCCGCGGGGC GGTGGCGGCC ATCGCCGCGG CAACGCTGAT CGGTGCGGCG GCCTGCGGCA GCGACAGCGG GTCGGACGGC GGCGGCTCCG CGCCGACCAC CGCCAGCACC GCCATTCCCT CCGCCCTCGC GAGCTACTTC CCGGGCAAGG CGGCGAGCGG CGACCCGGTG AAGATCGGGT TACTCAACAC CGAGGGCGGT CCCGCGCTGT CCGATCCCGA CATCGGCGAC GCCGCCGTCG CGGCGGCCGA GTACGCCAAC GCCCATCTGG GGGGCATCGG CGGCCACCCG ATCGAGATCG ACCGTTGCGG GATCCTCGAG GACGTCGCCT CGTCGGTCAA GTGCGCGAAC CAGATGGTCG AGGACAAGGT CGCGGCCGTC GTGGTCACCA CGACTGGCTT CGGCGAGTCC ATCGTGCCGA TCATCACCAA GGCCGGCATC CCGTACGTCT CGGTGGCCGG GGCGAGCAGC GGCGAGTTCA CCAGCAAGAA CGCCTACATG TGGACGGGCG GCTTCGCCGC CACCCTTTCC GCTATGGCCA CGTATTCGGC CGAGCAGGGC TACAAGAAGG TCGCCGCCTT CGTGACCGAC GTGCCGGCCG CGACAGGCGG GGCGGCCAAG ATCGGCGTCC CCGCGTTCAA GGCGGCCGGG ATCGAGTTCA CCGTCGTACC GGTCACGCCC GGCACGGCGG ACGCGACGCC CCAGGTGACC AGTGGCCTCG CCGGCAAGCC GGACGCCGCG ATCCTGATCT TCAATTCCAC CGGCTGCACG ACGGCGCTGA AGGCGCTCAG CGTGGTCGAC CCGACCATTC CCAAGCTGGG CATCACCGGC TGCCTCGACC CGGCGACGGT CGATGCCGTG GGCGGCGCGC TAGAGGGCGC CAAGGTCTTC GGCGTCTCCT CGATCGCGAC CGACGATCCC GAGGCACAGC TCTACCGGAC CGTGATGGCC AAGTACGCGC CCGACGCGTC GATCTCGGGC TACACGCCGG TCGGCTACCA GGGAATGCTC GGCCTGATCC GGGCCACCGC CAAGCTGACC GGCTCCGTCA CCAGCAGCTC GATCCTCGCC GCCATCGCGC AGGCCAAGGA CGTCCCGCTG CCGGCCGGTG CAGGGGTCAC CTTCACCTGC GACGGCAAGC AGCTTCCCGG CCTGACCACG GTGTGCTCCG CAGGAGAGAT CGTCCTGACG GTGAAGGACG GCGTGGGCAC GCAGCCCGAA ACCATCGACA ACTGA
|
Protein sequence | MIKTEHRLAT RATRRGAVAA IAAATLIGAA ACGSDSGSDG GGSAPTTAST AIPSALASYF PGKAASGDPV KIGLLNTEGG PALSDPDIGD AAVAAAEYAN AHLGGIGGHP IEIDRCGILE DVASSVKCAN QMVEDKVAAV VVTTTGFGES IVPIITKAGI PYVSVAGASS GEFTSKNAYM WTGGFAATLS AMATYSAEQG YKKVAAFVTD VPAATGGAAK IGVPAFKAAG IEFTVVPVTP GTADATPQVT SGLAGKPDAA ILIFNSTGCT TALKALSVVD PTIPKLGITG CLDPATVDAV GGALEGAKVF GVSSIATDDP EAQLYRTVMA KYAPDASISG YTPVGYQGML GLIRATAKLT GSVTSSSILA AIAQAKDVPL PAGAGVTFTC DGKQLPGLTT VCSAGEIVLT VKDGVGTQPE TIDN
|
| |