Gene Jann_3553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3553 
Symbol 
ID3936028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3616521 
End bp3617780 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content62% 
IMG OID637905928 
Productextracellular solute-binding protein 
Protein accessionYP_511495 
Protein GI89056044 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTA TTCTTATCAG TGCCAGCACC ATCGCCATGA TGGTCGCCGG CGCGGCGCAA 
GCCGACACGA CATTGAGGTT GGTTGAAGTG ATCACCAGCC CCGAGCGGAC GGCCGTTCTG
GAACAGATCG TGGCCGATTT TGAAGCCGCC AATGAGGGTG TGACGGTCGA GATTACCAGC
CTGCCCTGGG GCCAGGCGTT TGAGACGCTG GCGACCATGG TCGCGGGCGG CGACATCCCC
GATGTGGTGG AGATGCCCGA TACTTGGGCC GCGCTTTATG CGGGCTCGGA TCGTCTGGTG
GACCTGTCCG ACCATGTGGC CGGTTGGGAA GATGGCGCGA CACTGACCCA GCGGACCGTG
GACATGGGTA GCCAATCAGG CGCCTTCAAC ATGATCCCTT ACGGCTTCTA CCTGCGGGCG
ATGTTCTACA ACCGGGCCCT GCTGGAAGAG GCGGGCGTGG CCGAGCCGCC GCGCACGATG
GCGGACTTCG CGGCTGCCGC CGCAGCTGTG TCCGAATTGG ACGGTCGGTC CGGCTACTGC
CTGCGCGGCG GGCCGGGTGG CACCAACGGC TGGATCATGA TGGCAGCCGC GATGAACGGC
ACCAACGAGT TCTTCACCGA AGATGGCCAG TCGCGGATCA ACGAGCCGGG CTCTGTCGAA
GGGATCCAGT TCCTGCTGGA CATGTATCAG AACGGGAATG CGCCGCGTGA CAGCGTGAAC
TGGGGCTTTA ACGAGATCGT GGCGGGCTTC TATTCCGGCC AATGCGCGTT CCTTGACCAG
GACCCCGACG CGCTGATCGC GATTGCCGAG CGGATGGATG CGGACGACTT CGCCGTGATC
CCGATGCCCG TCGGCCCATC GGGTGAGGCA TTCCCGACCA TCGGATTTGC GGGTTGGTCG
ATCTTCAACA CGACCGAGCA CGAGGAAGAG GCCTTTGATC TGGTCGCGGC CCTGTCGTCG
CCCGAGGCAA ATTCCACCTG GGCGCAGCGG GTTGGCGTGA TCCCGATCCA CCAGGGTGCT
GATCAGGACC CTTATTTCCA GACCGACCAA TTCGCGGGCT GGTTTGAGAC CCTGAATGGC
GAGGAATATG TGCCCACGAT CATGCCCACC TACCTTGAGG AATGGGGCTA TTTCGCCAGC
ACAATCGTGG TTGAAACCAG CCAGGAAGCG CTTCTGGGAC AGATCACCGC GCAGGAATTG
GCCGACCAAT GGGCCGAGTA CCTGAACGAG GCGCAAGCCA ACTGGCAGGC CGCCCAGTGA
 
Protein sequence
MKRILISAST IAMMVAGAAQ ADTTLRLVEV ITSPERTAVL EQIVADFEAA NEGVTVEITS 
LPWGQAFETL ATMVAGGDIP DVVEMPDTWA ALYAGSDRLV DLSDHVAGWE DGATLTQRTV
DMGSQSGAFN MIPYGFYLRA MFYNRALLEE AGVAEPPRTM ADFAAAAAAV SELDGRSGYC
LRGGPGGTNG WIMMAAAMNG TNEFFTEDGQ SRINEPGSVE GIQFLLDMYQ NGNAPRDSVN
WGFNEIVAGF YSGQCAFLDQ DPDALIAIAE RMDADDFAVI PMPVGPSGEA FPTIGFAGWS
IFNTTEHEEE AFDLVAALSS PEANSTWAQR VGVIPIHQGA DQDPYFQTDQ FAGWFETLNG
EEYVPTIMPT YLEEWGYFAS TIVVETSQEA LLGQITAQEL ADQWAEYLNE AQANWQAAQ