Gene Jann_4098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_4098 
Symbol 
ID3936587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp4206920 
End bp4208629 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content62% 
IMG OID637906484 
Productextracellular solute-binding protein 
Protein accessionYP_512040 
Protein GI89056589 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00991271 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.791743 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTAA CCCTTAGAAC AACCTCGGCC CTGGCGCTGA CCGTGGGGCT TCTGGCCACT 
CCGGCCATCG CCGATATGGA AGCGGCCATC GCGTTCCTGG ACGAGCATAT CGAGCATTCC
GCGCTGACCC GCGAAGAGCA GGAAGCCGAG ATGCAATGGT TCGTCGACGC GGCCCAACCC
TATCAGGGTA TGGAGATCCG CGTTGTGTCA GAGACCATCG CCACCCACGA ATATGAGGCC
AATGTGCTGG CCCCCATCTT CGAGGCGATC ACCGGCATCA GCGTCACCCA CGACCTGATC
GGCGAAGGCG ACGTGGTGGA GCGTCTGCAA ACGCAGATGC AGACCGGCGA AAACATCTAT
GACGCCTACG TCAACGACAG TGATCTGATC GGCACCCACT GGCGCTATCA GCAGGCCCGC
AACCTGACGG ACTGGATGGC GAATGAGGGC GCGGACGTTA CCAACCCCAA TCTGAACCTT
GACGACTTCA TCGGCCTGTC GTTCACGACG GGTCCCGATG GGCTGCTGTA CCAGCTGCCC
GACCAGCAGT TCGCGAACCT CTATTGGTTC CGCTACGACT GGTTCACCGA TCCAGAAATC
ATGGCCGATT TCCAGGAGCA ATACGGCTAT GAGCTGGGTG TTCCGGTCAA CTGGTCTGCC
TATGAGGATA TCGCGGAATT CTTCACCGGT CGTGAAATCG ACGGTGTCGA GGTCTTCGGT
CACATGGACT ACGGTCGTCG CGATCCGTCG CTGGGTTGGC GTTTCACCGA TGCGTGGATG
TCCATGGCTG GCATGGGCGA CATTGGCGAG CCGAACGGCC TGCCCGTCGA TGAGTGGGGC
ATTCGTGTGA ACGAAGACAG CCGTCCCGTC GGCTCTTGCG TCGCGCGTGG CGGTGCTACC
AACGGCCCGG CCGCAGTCTA CGCCATTGAG TCGTACACCA ACTGGCTGAC CAACTACGCA
CCGCCGGAAG CGGCTGGTAT GAACTTCTCT GAGGCGGGGC CACTGCCGTC GCAGGGTGTG
ATTGCGCAGC AGATGTTCTG GTACACGGCG TTTACCGCGT CGATGGTCGG CGAGGGTGCG
GAAGCGGTGC TGAACGACGA CGGGTCGCCC CGTTGGCGGA TGGCCCCCAG CCCGCACGGT
GTCTACTGGC GCGAAGGTCA GAAGATCGGC TACCAGGACG CGGGGTCCTG GACGTTGATG
CAGTCCACAC CCGTGGACCG GGCGCAGGCC GCATGGCTTT ATGCTCAGTT CGTGACGTCG
ATGACCGTGG ATGTCGAGAA GTCCCATGAG GGCCTCACGT TTATCCGCGA GTCCACGATC
CAGCACGAGA GCTTCACCGA GCGTGCGCCA AACCTGGGGG GTCTGATCGA GTTCTATCGC
TCGCCCGCCC GCACCCAGTG GTCGCCAACT GGTACGAACG TGCCTGATTA TCCACGTCTG
GCGCCGCTGT GGTGGCAGAA CATCGGCGAT GCATCGTCCG GTGCACTGAC CCCGCAAGAG
GCGCTCGATA ACCTTTGTGC GCAGCAAGAG GCCGTTCTGG CCCGTCTTGA GCGGGCAGGC
GTTCAGGGGG ATCTCGGTCC GCTTCTCAAC GATGAAAGCG ATCCGGAGTT CTGGCTGTCT
CAGGACGGTG CGCCCCAGGC CGCCCTTGAG AACGAGGACG AAGAGCCACA AACCGTCAGC
TATGACGAGC TGATCCAATC CTGGCAGTAA
 
Protein sequence
MNLTLRTTSA LALTVGLLAT PAIADMEAAI AFLDEHIEHS ALTREEQEAE MQWFVDAAQP 
YQGMEIRVVS ETIATHEYEA NVLAPIFEAI TGISVTHDLI GEGDVVERLQ TQMQTGENIY
DAYVNDSDLI GTHWRYQQAR NLTDWMANEG ADVTNPNLNL DDFIGLSFTT GPDGLLYQLP
DQQFANLYWF RYDWFTDPEI MADFQEQYGY ELGVPVNWSA YEDIAEFFTG REIDGVEVFG
HMDYGRRDPS LGWRFTDAWM SMAGMGDIGE PNGLPVDEWG IRVNEDSRPV GSCVARGGAT
NGPAAVYAIE SYTNWLTNYA PPEAAGMNFS EAGPLPSQGV IAQQMFWYTA FTASMVGEGA
EAVLNDDGSP RWRMAPSPHG VYWREGQKIG YQDAGSWTLM QSTPVDRAQA AWLYAQFVTS
MTVDVEKSHE GLTFIRESTI QHESFTERAP NLGGLIEFYR SPARTQWSPT GTNVPDYPRL
APLWWQNIGD ASSGALTPQE ALDNLCAQQE AVLARLERAG VQGDLGPLLN DESDPEFWLS
QDGAPQAALE NEDEEPQTVS YDELIQSWQ