Gene Jann_3701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3701 
Symbol 
ID3936180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3783891 
End bp3785234 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content64% 
IMG OID637906078 
Productextracellular solute-binding protein 
Protein accessionYP_511643 
Protein GI89056192 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000375637 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.401469 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGGCC CTGGGCTGCA TCTAGACGAC TTGAAAAACA ACGGGCCAAA GCGGGGCAAT 
CCCACGGACC GCCCGAGAAA AACGAACGAA AACCAACGGG AGAGATATCC GATGTTTGTG
AAACGACTTG CCCTTGCGGG CGCAGTCTCC GTGGCGGCAA CAGCGGCCTA TGCCGATGGC
CACGCGTCCT GCGACTGGGA AAACACCACG GAAGTGACGA TGCTGTCCGC GGCATTTGAA
GCCTGGATGG CCGTGACCGA TGGCATGGCC GCCTGTGGCA ACTTCGCACC CGAGCTGGAT
CAGGAATTCC GCACCAAACA GCCCGAAGGC TTCGCGGCCT CGCCCTCGCT CTATGCGCTA
GGCGGTGTGT CCAACGGCAC GATCACCCCC CTGCTGAACC AGGACACGAT CCGCCCGCTG
GATGACCTGA TCGCGGCCCA CGGCGACAGC CTGACACCCA ACCAGCTGAT CCAGATCGAC
GGGCAGACCA TGGCCGTGGC GATGATGGTC AACGTCCAGC ACCTGATGTA CCGCGAGGAT
ATTCTGGCCG ATCTGGGCAT CGAGACGCCG TCGACCTGGG AAGAGGTCCT GACGGCCGCC
GAGACGATCC AGGAAGCTGG CGTCGTGGAT TACCCCCTGG GGGGCACCAT GAAGGCCGGT
TGGAACATCG CGCAGGAATT CGTGAACATG TTCAGCGGCT TCGGCGGTGA TTTCGTCAAT
GACGACAACA CCCCCAACGT CAACACAGAG GCCGGTCTTG CCTCCCTCGA CATGATGATG
CGCCTGACGG AATACATGGA TCCTGAATTC CTCGTGTCCG ACAGCACCTA TGTGCAGCAG
CAGTTCCAGC AGGGCTCCAT CGCGATGGCG AACCTCTGGG CGTCCCGTGG CGGTGCCATG
GATGATGAGG CGGAAAGCCA GGTCGTGGGT CTGGTGGCCT CTGCCGCAGC GCCGCTGGCC
GTGGATGGCG GCCCACCCGC CACCACCCTG TGGTGGGACG GTATCGTCGT GGCGGCCAAC
ATCACCGATG AGGAGGCCGA TAACGCGTTC CGTCTGGCGA TGGAAGGCAT CGACAGCGAG
ACTGTGGCCG CCGCACCGGA CGCTGCGATC TGGCTGATCG AAGGGTATGA GCCCAACGAC
ATGGCCGCCG GTGCCATCGC CTCGGCCACT GCCAACCCCG CGCCGCCGGC CTACCCGTCC
ACGACGCAGA TGGGCTTGAT GCACACGGCG CTTGGCAATG AGCTGCCCGC CTTCTTCACC
GGCGATGTCT CCGCGGAAGA AGCGCTGGCC GCTGTTGAGG CCGCCTATAC GGTCGCCGCA
CAGGAAGCGG GTCTGCTGGA GTAA
 
Protein sequence
MGGPGLHLDD LKNNGPKRGN PTDRPRKTNE NQRERYPMFV KRLALAGAVS VAATAAYADG 
HASCDWENTT EVTMLSAAFE AWMAVTDGMA ACGNFAPELD QEFRTKQPEG FAASPSLYAL
GGVSNGTITP LLNQDTIRPL DDLIAAHGDS LTPNQLIQID GQTMAVAMMV NVQHLMYRED
ILADLGIETP STWEEVLTAA ETIQEAGVVD YPLGGTMKAG WNIAQEFVNM FSGFGGDFVN
DDNTPNVNTE AGLASLDMMM RLTEYMDPEF LVSDSTYVQQ QFQQGSIAMA NLWASRGGAM
DDEAESQVVG LVASAAAPLA VDGGPPATTL WWDGIVVAAN ITDEEADNAF RLAMEGIDSE
TVAAAPDAAI WLIEGYEPND MAAGAIASAT ANPAPPAYPS TTQMGLMHTA LGNELPAFFT
GDVSAEEALA AVEAAYTVAA QEAGLLE