Gene Jann_2075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_2075 
Symbol 
ID3934528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp2082950 
End bp2084251 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content62% 
IMG OID637904431 
Productextracellular solute-binding protein 
Protein accessionYP_510017 
Protein GI89054566 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.114781 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.235936 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTTA AGAACACAAT CGGCGCAGGC CTTGCATTCG GCCTTTTGGC CGGAGCGGCT 
CAGGCGCAGA CCGAAATCGA ATGGTGGCAC GCGATGGGCG GCCAGCTGGG TGAGACCGTC
AACCAGATGG CGGAAAACTT CAACGCGAGC CAGGGTGACT ATGTCATCAC GCCCGTCTTC
AAAGGCACCT ATGAAGAGAC GCTGACCGCT GCCATCGCCG CCTTCCGCGC GGGCGAGCAG
CCCAACATCG TGCAGGTCTT CGATGCGGGT GCCGCGACGG TCATCGGCGC ACAGGGCGCG
ACCATCCCGG TGGAGCAGCT TTTGTCCGAG AACGGCGTTG ATTTCGACCG TGAGGATTAC
ATCTCGGGCG TGCGCAACTT CTATGCCGAC GCCGATGGCC AGATGATCGG CATGCCGTTC
AACTCCTCCA CGCCGATCAT GTACTACAAC GCCGATGCCC TGGAGGCCGC GGGTGTTGAG
CCGCCCGCCA CCTGGGAAGA ATTTGCCGAA GTCACAGCGC CCGCGCTGGC GGAAGCGGGC
TATGTGCCGC TGGCCCAGTC GCACCTGCCG TGGATCTTTA CCGAGAATTT CTTCTCGCGC
CACAACCTGC AGTTCGCGTC CAACGACAAT GGCTACACCG GCACCGATAC CGAGATCATG
GTCAACCACC CCGCCATCCG CGCGCATTTC ACCGCGCTGA CCGAATGGCA GGAGGCGGGC
TATTTTGAAT GGTACGGCAC CGGTTGGGCC GACAACCAGG ACCCCTTTGA AGCGGGCGAA
GTGGCCATGT GGCTCGGGTC TTCGGGCTCG TTCGGTGGCA TCGCCGACCG CGTTGACTTC
AACTTCTCCG CCGCCATGCT GCCTTATTGG GAAGCCGTGA CGACCGAGCC CACGCAGACC
TTCATCGGGG GCGCGGCTCT GTTCGCGATG TCCGGCTTTG ATGCGGAGCA GAATGAAGCC
ACAGCGGCCT TCTTCGACTT CCTCGACAGC GTCGATGCGC AGGTCATGTG GCACACGGAA
ACGGGCTATG TTCCGATCAC GACCGCCGCT TATGAAGCGA CGGCTGAGAC CGGTCACTAC
GACACGTTCC CGGCAGCCGA AGTGGGTGTC CAGCAGCTGC AGCTGCCCGC TGGTGAGTTC
ACCCGCGGCT ACCGCATGGG CTTCTACGTC CAGATCCGTG ACGTGATGAA CCGTGAGTAT
GGCCGCATCC TGACCGGTGA AACCTCCGTC GATGACGCGT TCGAGACGAT CGAAGCGGAG
GCCAACGAGC TTCTGTCGCG CTTCGCCCAG ACGCAGGGCT GA
 
Protein sequence
MNLKNTIGAG LAFGLLAGAA QAQTEIEWWH AMGGQLGETV NQMAENFNAS QGDYVITPVF 
KGTYEETLTA AIAAFRAGEQ PNIVQVFDAG AATVIGAQGA TIPVEQLLSE NGVDFDREDY
ISGVRNFYAD ADGQMIGMPF NSSTPIMYYN ADALEAAGVE PPATWEEFAE VTAPALAEAG
YVPLAQSHLP WIFTENFFSR HNLQFASNDN GYTGTDTEIM VNHPAIRAHF TALTEWQEAG
YFEWYGTGWA DNQDPFEAGE VAMWLGSSGS FGGIADRVDF NFSAAMLPYW EAVTTEPTQT
FIGGAALFAM SGFDAEQNEA TAAFFDFLDS VDAQVMWHTE TGYVPITTAA YEATAETGHY
DTFPAAEVGV QQLQLPAGEF TRGYRMGFYV QIRDVMNREY GRILTGETSV DDAFETIEAE
ANELLSRFAQ TQG