Gene Franean1_6358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6358 
Symbol 
ID5674674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7718556 
End bp7720115 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content67% 
IMG OID641245207 
Productextracellular solute-binding protein 
Protein accessionYP_001510602 
Protein GI158318094 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCCCA AAGTACGATT ACTCGTCACC GCTGCCGTCT GTTGCGCAAC GCTGGCCCTG 
GGCGCCTGCG GGGGAGGTGG CGACACCGGC CCGTCCTCGG GTGCCTCCGG CGAACCGGTT
GCCGGTGGCG AGGGAAGAAT TCTCACCCTC AGCGATCCGC GCAGCCTCGA CCCGGCGGCT
CTCGGCAACG CCTACGCAAC CACCGGTGTT GTGGGAAACG CGCTGTACGG AACGCTGATG
ACCGACCCGG GCGGCAAAAT ACGGTACTCG ATGGCCGAGT CCTTCCAGAC CACCGACGCC
GGGGCGACAT TCGAGCTGAA ACTGCGGTCG GGTCTGGTGT TCTCCGACGG AACCTCACTG
GACGCCGAAG CCGTGAAGTT CAACTGGGAC AGGCTCAAGA ACCCGGCCAC CGCCGCCATC
TCCCGGTCGG AGGCGTCGAT GATCGCCTCA TCCGACGTGG TCGATGACAC CACCTTGAAG
ATCACCATGG CCACGCCGGT GCCGAAGTAC GCCCAAGCCG TCCTCACCTC GTCCCTGAAC
TGGATCGCCT CGCCGACCGC CCTGGAGAAG GGGCCGCAGG CCTTCGACGC GAACCCGATC
GGCGCCGGGC CATTCACCCT GCGGAGCTGG ACACGCCAGG CCGCCATGGA ACTGGTCAAG
AACCCCCGCT ACTGGGACGC CCCCAAGCCC TACCTCGACC GTCTCACCCT CCGCGCGGCC
CTCGACAGCA GCCAGCGCTA CAACACGGTG CTCACCGGGG GCGCGGACGC GGCCGTCGAG
TCGAGCTGGG TCAACCTCGA CAAAGCCGAG CAGGCCGGCC TGCCGACGAA CCTCATACCG
ACCGGCGGCG GCATCTTCAT GGCGCTGAAC ACACGCAGGG CACCCTTCGA CGACGTCCGC
GCCCGCCAGG CCCTCGCCGC GGCAATCGAC AGGGACGCAC TCAACCAGGC TGTCTACAGC
GGGACCGGCG AGCCCGTCGA CACACTGTTC AGTAAGGACT CTCCTTACTA CTCGGACACG
CCGCTGGCGA CAACGGACCG TGCACGGGCG CAACGGCTCC TCGACGAGCT GGCCGCCGAC
GGCAAACCGC TGTCCTTCGT CTTCTCCAGC GTCCCCACGA CGGATGGCAA GGCGATCGCG
GAGAACATCC AGGCCCAGCT CAGCAGCTTC AAGAACGTCA CCGTCAAGAT CAAGACCATC
GAGGTCGCGG AGCTGGCCGC GCTGCGCACC ACCCACGACT TCGACGTGCT CGTCTCGTCG
GCCTTCTTCC GGGACCCCGA ACCGCGGCTG TGGACGACCT TCCACGGGAC CTCGGCGGCG
AACCTGCCCG GCATCAACGA CCCGGCACTC AACGAAAGCC TCGCGGCCGC GCGCACCGCG
ACTTCGGAGC CCGAGCGCGA ATCCGCCTAC GGGACACTGC AGGAACGGCT GGCAGAGCTG
ACCCCGGTGG TCTTCCTCGC GCAGGCGGCA CCCAGCGCCT TCTCGAGCAA GAACGTCGGC
GGACTCGTAC AGTACGGCCT CGGCTCACTT CAGCCCGAGG AACTCTGGAT TCAGCGCTAG
 
Protein sequence
MIPKVRLLVT AAVCCATLAL GACGGGGDTG PSSGASGEPV AGGEGRILTL SDPRSLDPAA 
LGNAYATTGV VGNALYGTLM TDPGGKIRYS MAESFQTTDA GATFELKLRS GLVFSDGTSL
DAEAVKFNWD RLKNPATAAI SRSEASMIAS SDVVDDTTLK ITMATPVPKY AQAVLTSSLN
WIASPTALEK GPQAFDANPI GAGPFTLRSW TRQAAMELVK NPRYWDAPKP YLDRLTLRAA
LDSSQRYNTV LTGGADAAVE SSWVNLDKAE QAGLPTNLIP TGGGIFMALN TRRAPFDDVR
ARQALAAAID RDALNQAVYS GTGEPVDTLF SKDSPYYSDT PLATTDRARA QRLLDELAAD
GKPLSFVFSS VPTTDGKAIA ENIQAQLSSF KNVTVKIKTI EVAELAALRT THDFDVLVSS
AFFRDPEPRL WTTFHGTSAA NLPGINDPAL NESLAAARTA TSEPERESAY GTLQERLAEL
TPVVFLAQAA PSAFSSKNVG GLVQYGLGSL QPEELWIQR