Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3291 |
Symbol | |
ID | 5671663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3898027 |
End bp | 3899586 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641242180 |
Product | extracellular solute-binding protein |
Protein accession | YP_001507600 |
Protein GI | 158315092 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGCGTA GGAGACGTTT CCTGTTTGTG GCAGTAGCCA GTGGTCTTGC GGCGGTCCTC ACCGCCTGCG GTGGCTCCGG TTCCCCCTCG ACGTCATCTG CCTCAGGTGA ACCGGTCGCC GGCGGTCACG GCCGGATCCT CATGTTGGGC GAGCCGCGCA GTCTGGATCC AGCGGCGCTC GGCAACGCTT ATGCGATCAA TGCCGCCGTG GGTAACGCCT TATACGGAAC ATTGATGACC GACGACGACA GCGGTAAGAT TCAGTTCTCC ATGGCGGAGT CGTTCACCAC CGCCGACAAC GGCGCCACTT TCGAACTGAA ACTGCGGCCG GATCTGGTGT TCTCCGACGG GGCGCCGCTG AATGCCGCTG CAGTGAAGTT CAACTGGGAC CGCCTGAAGA ATCCTGCTAA CGGCGCCACC TCCCTCGCCC AGGCATCCGT GGCCGCCTCT ACCGAGGTGG TGGACGACCT CACGCTCAAG GTCACGATGG TCACTCCCAT GCCCAGGTAC GCGGGCTCCG TCATCACCTC GTCGATGAAC TGGATCGCCT CGCCCGCCGT GCTGGAGAAG GGAACGGAGG CCTTCGACAA GGCCCCGATC GGTGCGGGTC CCTTCACTCT GAAGAGCTGG ACCCGCCAGG CCAGCATCGA GCTGGCGAAG AACCCCAGGT ACTGGGACGC CCCCAAGCCC TATCTGGACA CGCTCACCCT GCGCACGCTT GCCGACACCA ACCAGCGCTT CAACACGGTC CTTTCCGGCA CCGCGGACGC GGCCGCCGAG TCCAGCTGGC AGAACTTCTC GAAGGCCGAG GAACAGGGCC TGGCCCTCGG CAGGCAGAAC GTCAACGGTG GACTGTTCCT CACGATGAAC TCACGCCGGG CGCCGTTCGA CGATCCCCGC GCCCGCCGGG CCATCGCCGC CGCGCTGGAC CTCGACGCTC TCAACCTGGC CGTCTACAAC GGCCTGGGAA AGCCGGTCGA GACGCTGTTC ACCGAGGGCT CGCCCTTCTA CTCGAACATT CCGCTCCGCA AGGTCGACAA GGCTGCCGCG CAGCGCCTGT TCGACGAGCT GGCCGCGGCG GGCAAACCCG TCTCCTTCAC GTTCTCCGCG TTCCCCAGCA CAGAGAACCG GGCGATGGCG GAGAACGTCC AGGCACAGCT CAGCACCTTT GACAACGTCA AGGTCAACAT CGAGCCCCTC GACCAGTCCA GGCTCGGAGA GCTGTATTCG AAGCGGGACT TCGACATGGT CACCCTGTCC TCCTTCTTCT ACGACCCCGA CCCGGTGCTG TCGACGGTCT TCGACGGGAG CTCGCCGTCC AACCTGTCCG GCATCAACGA CCCGGAACTC AATGAGGCCC TGCAGGCCGG CCGCACCGCG ACGAGCGACG AGGAGCGCGG GAAGGCCTAC GAGACCGTGC AGCGGCGGCT CGCGGACCAG GTCCCGGTGG TCTTCATCAC GCGGGTGGCG CTGGGTGCCA TCGGCGGGCA GAACGTCGGC GGCATCAGGC TCTACGGCAA CGGCTCGCTG CTGCCCGAGG AGCTGTGGAT CAGCAAGTAG
|
Protein sequence | MLRRRRFLFV AVASGLAAVL TACGGSGSPS TSSASGEPVA GGHGRILMLG EPRSLDPAAL GNAYAINAAV GNALYGTLMT DDDSGKIQFS MAESFTTADN GATFELKLRP DLVFSDGAPL NAAAVKFNWD RLKNPANGAT SLAQASVAAS TEVVDDLTLK VTMVTPMPRY AGSVITSSMN WIASPAVLEK GTEAFDKAPI GAGPFTLKSW TRQASIELAK NPRYWDAPKP YLDTLTLRTL ADTNQRFNTV LSGTADAAAE SSWQNFSKAE EQGLALGRQN VNGGLFLTMN SRRAPFDDPR ARRAIAAALD LDALNLAVYN GLGKPVETLF TEGSPFYSNI PLRKVDKAAA QRLFDELAAA GKPVSFTFSA FPSTENRAMA ENVQAQLSTF DNVKVNIEPL DQSRLGELYS KRDFDMVTLS SFFYDPDPVL STVFDGSSPS NLSGINDPEL NEALQAGRTA TSDEERGKAY ETVQRRLADQ VPVVFITRVA LGAIGGQNVG GIRLYGNGSL LPEELWISK
|
| |