Gene Jann_3838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3838 
Symbol 
ID3936318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3932710 
End bp3933972 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content60% 
IMG OID637906216 
Productextracellular solute-binding protein 
Protein accessionYP_511780 
Protein GI89056329 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.754768 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATCCA TTTACAAGAC CGGTGCCGCA GCACTGGCAG CAGCGACCAT GCTTTCCGGC 
GTTGCCTATG CAGACGGCCA CGGCGTTTCT GGCCCCCTGC GCATCTTCTC GGACATGTCA
AATCCGGCTC CGCGCGCCGT CATGGAAGGC ATGGTTGAGC GGTTCACAGC CGAAAATCCC
GGCGTCGAGG TCGAACTGGT CATCACTGAC CGCGAGGCCT GGAAGACGCA GATCCGCAAC
GTGCTGCAAT CCGGCACCGC TGACATCGTC AACTGGTACG CCGGGAACCG CATGGCCCCC
TACGTAGATG CGGGCCTGTT CATGGACATC AGCGACCTCT GGGAAGAGGG CGGCCTGACC
GAAACGCTCG CCTCCACCCA AGGCTCCATG ACCATGGATG ACGGCATCTG GGGCGTTCCC
TACACATACT ATCAGTGGGG CGTATACTAC CGCGAGGATA TCTTCGCAGA GCTTGGCCTT
GAAGCGCCGA CGACCTGGGA AGAAGAACTG GCAAATTGCA CCGCGATCGT CGAATCCGGT
CGTGCCTGCT ACACCATCGG CACGCAGTTC CTGTGGACCG CTGGCGGCTG GTTCGACTAC
CTCAACATGC GCACCAATGG CTACGACTTC CACATGTCCC TGACAGCTGG CGAAGTAGAG
TGGACCGATC CCCGCGTCCG CGCCACGTTC GAGAACTGGC AGACCCTGAT CGATATGGGC
GGCTTTATCG CTGACCACCA GTCCTATTCT TGGCAGGAAG CGTTGCCCTT CATGGTCAAC
GGCGAAGCCA CGGCATATCT GATGGGGAAC TTCGCAGTGG CACCGCTGCG CGAAGCTGGC
CTCTCGGATG ATCAGCTGGG CTTCTACCAG TTCCCGCAGA TCGACGCCGC GATTGAGCCG
GGTGAAGATG CCCCGACCGA TACCTTCCAC ATCGCGGCTA ATGCCGAGAA CGTGGAAGCG
GCAGAGGCGT TCCTCCTGTT CGTGACCAGC GCCGAGAACC AGACGCTGAT CAACAACGGC
GACAACCTGG GTCAGCTGCC CGTCAACGCC AACTCCGGCG TCGATGATGA TGAATTCCTG
AATGCGGGCT TCGACATGCT GTCGAACAAC GCAGGCGGCG GCATCGCGCA GTTCTTTGAC
CGTGACGCAC CGGCTGAAAT GGCGCAGGTC GCCATGCAGG GCTTCCAGCA GTTCATGGTT
GATCCGTCTA CGCTCGACCA GGTTCTGCAG ATCCTAGAAG GCGCGCGTCA GCGGATTTAC
TAA
 
Protein sequence
MRSIYKTGAA ALAAATMLSG VAYADGHGVS GPLRIFSDMS NPAPRAVMEG MVERFTAENP 
GVEVELVITD REAWKTQIRN VLQSGTADIV NWYAGNRMAP YVDAGLFMDI SDLWEEGGLT
ETLASTQGSM TMDDGIWGVP YTYYQWGVYY REDIFAELGL EAPTTWEEEL ANCTAIVESG
RACYTIGTQF LWTAGGWFDY LNMRTNGYDF HMSLTAGEVE WTDPRVRATF ENWQTLIDMG
GFIADHQSYS WQEALPFMVN GEATAYLMGN FAVAPLREAG LSDDQLGFYQ FPQIDAAIEP
GEDAPTDTFH IAANAENVEA AEAFLLFVTS AENQTLINNG DNLGQLPVNA NSGVDDDEFL
NAGFDMLSNN AGGGIAQFFD RDAPAEMAQV AMQGFQQFMV DPSTLDQVLQ ILEGARQRIY