Gene Jann_1356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1356 
Symbol 
ID3933803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1320088 
End bp1321344 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content62% 
IMG OID637903706 
Productextracellular solute-binding protein 
Protein accessionYP_509298 
Protein GI89053847 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0142784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.791743 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATGA AGAAACTTAC CGCCAGCCTG CTGGCCACAA CCATGCTGGT CGGCACCGCC 
GCATCTGCGC AGGATGTGAC GCTGACGATC GAAAGCTGGC GCAACGACGA CCTGACGCTT
TGGCAGGACG TCATTATCCC GGCGTTTGAA GCCGAAAACC CCGGCATCTC GGTTCAGTTC
ACGCCGTCTG CGCCTGCGGA ATACAACGCC GTCCTGAACT CGAAGCTGGA CGCAGGCTCC
GCCGGTGACC TGATCACCTG CCGTCCGTTT GATGCGTCCC TCGCGCTCTA TGAGGCGGGT
CACCTGACCG ACCTCAGCGA TCTGGACGCG ATGGCCAACT TCTCGGACGT GGCGCAATCC
GCATGGCAGA CCGATGATGG CGCGGCGACC TTCTGCATGC CGATGGCCTC CGTGATCCAC
GGCTTCATCT ACAACGCCGA CGCCTTCGCG GAGCTTGGCT TGGAAGAGCC CACGAACGTT
GACGAATTCT TCGCCGTGCT CGACGCGATT GAGGAGGACG GCAATTACAT CCCGATGGCC
ATGGGCACCG CCGACCAGTG GGAAGCGGCC ACCATGGGCT ACAACAATAT CGGGCCAAAC
TACTGGCGCG GCGAAGAAGG TCGCCTCGCC TTGATTGCCG GTGAACAATC ACTCACGGAT
CCCGAATGGG TCGGACCGCT GGAACAGCTT GCACGTTGGG GCGATTACCT CGGCCGCGGC
TATGAGGCAC AGACCTATCC GGATAGCCAA AACCTGTTCA CCTTGGGGCG CGCCGCGATT
TATCCGGCAG GCAGCTGGGA AATCACCGGC TTCAACGCGC AGGCCGACTT TGCTATGGGT
GCCTTCGCCC CGCCGGTGCC GAACGCGGGC GATGAGTGTT TCATCTCGGA TCACACCGAT
ATCGCCATCG GTCTGAACGC CGCCTCCCCC AATGCCGAAG CCGCACGCAC CTTCCTCAAT
TGGGTTGGTT CGGCTGAGTT TGCCTCCATC TACGCCAACG CGCTGCCGGG CTTCTTCCCG
CTGTCAAACG CCGAGGTTGA ACTGGAAGAT CCGCTGGCGC AGGAAATGAT TTCCTGGCGA
GGGGAGTGTG AAAGCTCGAT CCGGTCCACC TACCAGATCC TGTCGCGCGG CACGCCGAAC
CTGGAAAACG AGACGTGGAA CGCCTCCACC CAGGTGATCC GTGGCGCAGA AGCCCCCGCC
GACGCCGCCG CGCGTCTGCA GGAAGGCCTC GCCTCCTGGT ACGAACCGCA GCAGTAA
 
Protein sequence
MSMKKLTASL LATTMLVGTA ASAQDVTLTI ESWRNDDLTL WQDVIIPAFE AENPGISVQF 
TPSAPAEYNA VLNSKLDAGS AGDLITCRPF DASLALYEAG HLTDLSDLDA MANFSDVAQS
AWQTDDGAAT FCMPMASVIH GFIYNADAFA ELGLEEPTNV DEFFAVLDAI EEDGNYIPMA
MGTADQWEAA TMGYNNIGPN YWRGEEGRLA LIAGEQSLTD PEWVGPLEQL ARWGDYLGRG
YEAQTYPDSQ NLFTLGRAAI YPAGSWEITG FNAQADFAMG AFAPPVPNAG DECFISDHTD
IAIGLNAASP NAEAARTFLN WVGSAEFASI YANALPGFFP LSNAEVELED PLAQEMISWR
GECESSIRST YQILSRGTPN LENETWNAST QVIRGAEAPA DAAARLQEGL ASWYEPQQ