Gene Franean1_7039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7039 
Symbol 
ID5675350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8589549 
End bp8590835 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content63% 
IMG OID641245885 
Productextracellular solute-binding protein 
Protein accessionYP_001511276 
Protein GI158318768 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGTA AATCCTTTAC CGGAATATTA GCAGTGACCG CCGCCGTGGC GCTTCTGCTC 
GCCGGGTGCG GCCAGGCGGG GAACAGCACG ACGACCGCCG ACGGCAGGAT CCAGATACCC
ATGTGGACCC ACTCGGCCGG CAACCCCGCC GAGCTCGCGG TGTACAAGCA GATCATCTCC
GACTTCAACG AGTCGCAGGA CAGGTACGAG GTAGTACAAC AAGACTTTCC CCAGGTCACC
TATAACGACG CGATTGTCGC GGCCGCCGCA GCGGGGGACC TGCCGTGCCT CATCGACATG
GACGGACCGG TGATGCCGAA CTGGGCCTGG TCCGGCTACC TGCAGGAGCT CAATCTTCCC
AAGCAGCTCA CGGACAGCCT GCTGCCGACG GCGGTCGGCA CGTACAAGGG AAAGATCTAT
TCGGCCGGAT ACTGGGACGC CGCACTGGCA ATTTTCGCGC GCAAGTCGGT GCTTGACAAG
AACGATATCC GCATCCCCAC CGTGGACAGG CCGTGGACGA AGGACGAGTT CGACTCCGCG
CTCGCGACGC TGCAACAGGC CGGATACGAT ACTCCGCTCG ACATCGGCGC GGAGGACACC
GGCGAGTGGT GGTCGTACGC GTACTCCCCC ATGCTGCAGA GCTTCGGTGG CGACGAAATC
AATCGAGACA CCTACCGCAC GGCAGAGGGC GCTCTTAACG GGCCGGCTGC CGTGAACTTC
TTCACGTGGT TCCAGGATGC CTTCAAGAAA GGCTGGGCAA GCAACTCCGG CACGATCGGG
AACCAGGAGT TCGTCGACGA CAAGGTCGCG CTGAGCTACA CGGGCGTGTG GAATGCGCTC
GACTCGCTCG AAAAGATCGG CGACGACCTG CTCATCCTCC CGCCTCCGGA CTTCGGCCAG
GGGCCCAAGA TCGGCGGTGG CTCATGGCAG TGGGGCATCA CGGCCGGATG CGAGCAAGCC
GACGGCGCGC GCCAGTATCT GCGGTTCAGT TTCCAGGACA AGTACATCGC GCAGTTCGCG
GACAGCCAGA TCGTCATCCC CGCGACGGCC GGTGCCGAGG AGCTCTCGAA GTACTTCACG
GCCGATGGCG CTCTGCGCCC CTTCGTCGTG CTCTCCCAGA AGTTCGCCCT CGCGCGGCCC
GCGACCCCGG CCTATTCCGT GATCTCGTCG ATCTTCGAGA AGGCGACAAA GGACATCATG
AACGGCGCCG ATGTGAAATC CACACTCGGC AGTGCCGTGG AGGATATCGA CGAGAACATC
ACGGCCAACG ACAACTACGG CTCCTGA
 
Protein sequence
MKRKSFTGIL AVTAAVALLL AGCGQAGNST TTADGRIQIP MWTHSAGNPA ELAVYKQIIS 
DFNESQDRYE VVQQDFPQVT YNDAIVAAAA AGDLPCLIDM DGPVMPNWAW SGYLQELNLP
KQLTDSLLPT AVGTYKGKIY SAGYWDAALA IFARKSVLDK NDIRIPTVDR PWTKDEFDSA
LATLQQAGYD TPLDIGAEDT GEWWSYAYSP MLQSFGGDEI NRDTYRTAEG ALNGPAAVNF
FTWFQDAFKK GWASNSGTIG NQEFVDDKVA LSYTGVWNAL DSLEKIGDDL LILPPPDFGQ
GPKIGGGSWQ WGITAGCEQA DGARQYLRFS FQDKYIAQFA DSQIVIPATA GAEELSKYFT
ADGALRPFVV LSQKFALARP ATPAYSVISS IFEKATKDIM NGADVKSTLG SAVEDIDENI
TANDNYGS