Gene Jann_3935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3935 
Symbol 
ID3936416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp4032565 
End bp4033827 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content63% 
IMG OID637906313 
Productextracellular solute-binding protein 
Protein accessionYP_511877 
Protein GI89056426 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0774395 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.611638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGA TGAAAAAACT ATCTACGGGC GCAGCCGTGC TGGCCCTGTC AACGACGGCG 
ATCACGTCTG TTGCATCGGC ACAGGACGTG GAGGTTCTGC ACTGGTGGAC GTCCGGCGGC
GAAGCGGCTG CCCTGAACGT GTTGCGCGAA GATCTTGCCG GGGGTGGCAT CGGTTGGACC
GACATGCCAG TCGCTGGCGG CGGTGGCTCT GACGCCATGA CCGTCCTGCG CGCCCGCGTC
ACCGCAGGCG ACCCGCCCAC CGCCGTGCAG ATGCTTGGCT TCTCGATTCA GGACTGGGCC
GCCGAAGGCG CGCTGGCAGA CCTGAACGCG CTGGCGGAAG AGCAGAACTG GAATGAAGTG
GTGCCGGAAG CGCTGCAAGC GTTCTCCACC TACGAGGGCA ACTGGGTTGC CGCGCCGGTT
AACGTCCACT CCACCAACTG GGTCTGGGCC AACACCGCGC TGATGGAAGA GCTTGGTATC
GAGCAGCCCG GCACTTGGGA AGAGTTCGTC GCCGCGATGG AAACGGCAGC AGAGGCGGGC
TATACGCCGC TGGCCCACGG CGGTCAGGCT TGGCAGGACG CCACGATCTT CGACTCGATG
GTGATGGGCG TTGGTGGACC CGAGTTCTAT CAGGCCTCCA TGGTCGATCT GGATCCTGAA
GCCCTCGGCG GTGCGCAGAT GGTTGAAGCG TTTGACCGTA TGGCGACCCT GCGCGGCTTC
GTGGATGACA ACTTCTCCGG TCGTGACTGG AACCTGGCCT CTGCCATGGT GATCAATGGC
GAAGCGCTGT TCCAGATCAT GGGTGACTGG GCGAAGGGCG AATTCGTCAA TGCTGGCCTG
ACGGCTGGCG ACGAATTCCA GTGCTTCCGC GTGCCCGGCA CCGAAGGCAC CGTGACCTTC
AACTCCGACC AGTTCGCGAT GTTCGGCGTC GAGGACGAAG GCGATCAGGC GTCACAGGTT
GCCATGGCCT CTGCCGTGAT GTCGCCTGAA TTCCAGATCG CGTTCAACGT GGTGAAGGGC
TCTGCCCCTG CGCGCACCGA CATCGACGCA TCGTCCTTCG ACGCTTGTGG TCAGGCCGCC
ATGGCGGATC TGGCGGCAGC CGGTGAAAGC GGTGGCCTGT TCGGCTCCAT GGCCCACGGC
CACGCCAACC CGCCGTCGAT CCAGAACGCG ATGTACGACG TGATCACCGC CCACTTCAAC
GGTGAGTTCG ACTCAGCAAC CGCCGCCGAA GAGATGGTCA CCGCCGTTGA GCTTGCTCAA
TAA
 
Protein sequence
MTMMKKLSTG AAVLALSTTA ITSVASAQDV EVLHWWTSGG EAAALNVLRE DLAGGGIGWT 
DMPVAGGGGS DAMTVLRARV TAGDPPTAVQ MLGFSIQDWA AEGALADLNA LAEEQNWNEV
VPEALQAFST YEGNWVAAPV NVHSTNWVWA NTALMEELGI EQPGTWEEFV AAMETAAEAG
YTPLAHGGQA WQDATIFDSM VMGVGGPEFY QASMVDLDPE ALGGAQMVEA FDRMATLRGF
VDDNFSGRDW NLASAMVING EALFQIMGDW AKGEFVNAGL TAGDEFQCFR VPGTEGTVTF
NSDQFAMFGV EDEGDQASQV AMASAVMSPE FQIAFNVVKG SAPARTDIDA SSFDACGQAA
MADLAAAGES GGLFGSMAHG HANPPSIQNA MYDVITAHFN GEFDSATAAE EMVTAVELAQ