Gene Franean1_3291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3291 
Symbol 
ID5671663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3898027 
End bp3899586 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content65% 
IMG OID641242180 
Productextracellular solute-binding protein 
Protein accessionYP_001507600 
Protein GI158315092 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCGTA GGAGACGTTT CCTGTTTGTG GCAGTAGCCA GTGGTCTTGC GGCGGTCCTC 
ACCGCCTGCG GTGGCTCCGG TTCCCCCTCG ACGTCATCTG CCTCAGGTGA ACCGGTCGCC
GGCGGTCACG GCCGGATCCT CATGTTGGGC GAGCCGCGCA GTCTGGATCC AGCGGCGCTC
GGCAACGCTT ATGCGATCAA TGCCGCCGTG GGTAACGCCT TATACGGAAC ATTGATGACC
GACGACGACA GCGGTAAGAT TCAGTTCTCC ATGGCGGAGT CGTTCACCAC CGCCGACAAC
GGCGCCACTT TCGAACTGAA ACTGCGGCCG GATCTGGTGT TCTCCGACGG GGCGCCGCTG
AATGCCGCTG CAGTGAAGTT CAACTGGGAC CGCCTGAAGA ATCCTGCTAA CGGCGCCACC
TCCCTCGCCC AGGCATCCGT GGCCGCCTCT ACCGAGGTGG TGGACGACCT CACGCTCAAG
GTCACGATGG TCACTCCCAT GCCCAGGTAC GCGGGCTCCG TCATCACCTC GTCGATGAAC
TGGATCGCCT CGCCCGCCGT GCTGGAGAAG GGAACGGAGG CCTTCGACAA GGCCCCGATC
GGTGCGGGTC CCTTCACTCT GAAGAGCTGG ACCCGCCAGG CCAGCATCGA GCTGGCGAAG
AACCCCAGGT ACTGGGACGC CCCCAAGCCC TATCTGGACA CGCTCACCCT GCGCACGCTT
GCCGACACCA ACCAGCGCTT CAACACGGTC CTTTCCGGCA CCGCGGACGC GGCCGCCGAG
TCCAGCTGGC AGAACTTCTC GAAGGCCGAG GAACAGGGCC TGGCCCTCGG CAGGCAGAAC
GTCAACGGTG GACTGTTCCT CACGATGAAC TCACGCCGGG CGCCGTTCGA CGATCCCCGC
GCCCGCCGGG CCATCGCCGC CGCGCTGGAC CTCGACGCTC TCAACCTGGC CGTCTACAAC
GGCCTGGGAA AGCCGGTCGA GACGCTGTTC ACCGAGGGCT CGCCCTTCTA CTCGAACATT
CCGCTCCGCA AGGTCGACAA GGCTGCCGCG CAGCGCCTGT TCGACGAGCT GGCCGCGGCG
GGCAAACCCG TCTCCTTCAC GTTCTCCGCG TTCCCCAGCA CAGAGAACCG GGCGATGGCG
GAGAACGTCC AGGCACAGCT CAGCACCTTT GACAACGTCA AGGTCAACAT CGAGCCCCTC
GACCAGTCCA GGCTCGGAGA GCTGTATTCG AAGCGGGACT TCGACATGGT CACCCTGTCC
TCCTTCTTCT ACGACCCCGA CCCGGTGCTG TCGACGGTCT TCGACGGGAG CTCGCCGTCC
AACCTGTCCG GCATCAACGA CCCGGAACTC AATGAGGCCC TGCAGGCCGG CCGCACCGCG
ACGAGCGACG AGGAGCGCGG GAAGGCCTAC GAGACCGTGC AGCGGCGGCT CGCGGACCAG
GTCCCGGTGG TCTTCATCAC GCGGGTGGCG CTGGGTGCCA TCGGCGGGCA GAACGTCGGC
GGCATCAGGC TCTACGGCAA CGGCTCGCTG CTGCCCGAGG AGCTGTGGAT CAGCAAGTAG
 
Protein sequence
MLRRRRFLFV AVASGLAAVL TACGGSGSPS TSSASGEPVA GGHGRILMLG EPRSLDPAAL 
GNAYAINAAV GNALYGTLMT DDDSGKIQFS MAESFTTADN GATFELKLRP DLVFSDGAPL
NAAAVKFNWD RLKNPANGAT SLAQASVAAS TEVVDDLTLK VTMVTPMPRY AGSVITSSMN
WIASPAVLEK GTEAFDKAPI GAGPFTLKSW TRQASIELAK NPRYWDAPKP YLDTLTLRTL
ADTNQRFNTV LSGTADAAAE SSWQNFSKAE EQGLALGRQN VNGGLFLTMN SRRAPFDDPR
ARRAIAAALD LDALNLAVYN GLGKPVETLF TEGSPFYSNI PLRKVDKAAA QRLFDELAAA
GKPVSFTFSA FPSTENRAMA ENVQAQLSTF DNVKVNIEPL DQSRLGELYS KRDFDMVTLS
SFFYDPDPVL STVFDGSSPS NLSGINDPEL NEALQAGRTA TSDEERGKAY ETVQRRLADQ
VPVVFITRVA LGAIGGQNVG GIRLYGNGSL LPEELWISK