Gene Jann_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_2047 
Symbol 
ID3934500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp2051067 
End bp2052656 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content61% 
IMG OID637904403 
Productextracellular solute-binding protein 
Protein accessionYP_509989 
Protein GI89054538 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.101703 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.665186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGT TCAAACCGGC CCTGCTGTCG GCGATGCTGG CGCTGCCGCT GGCAGCCCCC 
GCGGCCCTTG CGGAAACGCC GCCCGAGATC CTGGTGGTCG CGCAGAATAT CGACGACATC
GTCGCGATTG ACCCGGCTCA GGCCTATGAG TTCACCTCTG GCGAGTTGGT GACGAACACC
TATGACCGTC TTGTGCAATA CGATGCCGAA GACACGACCG TGCTGGCCCC CGGTCTTGCC
ACGGAGTGGG AGATTGACGC CGAGGCCAAG ACCATCGTCT TCACCATGCG CGAGGGTGTG
ACGTTCCATT CCGGCAATGC GTTCACGGCG GATGACGTGG TTGGCTCCTT CGCCCGCGTG
GTGCAGTTGA ACCTGACGCC TGCGTTCATC CTGACGCAGC TTGGCTGGAC GCCCGAAAAC
GTCGAGGAGA TGGTGACAGC GGACGGCAAC ACTGTGACCG TGCGCTATGA TGGCGACTTT
TCCCCGGCCT TCGTGATGAA CGTGCTGGCG TCGCGGCCTG CCTCCATCGT GGATATGGAA
ACGGTCATGG CCAACGCGGT TGACGGCGAT ATGGGCAATG CGTGGCTCAA CGCCAACACG
GCGGGCACGG GGCCGTTTTC GCTCAACACC TTCCGCGCGG CAGAGCTGAT CCGCATGGAT
GCCAACCCTG ACTATTTCAA CGGTGCCCCC GCCATCGAGG GTGTAATCAT CCGCCATGTC
GCGGAATCGG CGACACAGCA ATTGTTGCTG GAAGCGGGTG ATGTTGATAT TGCGCGCAAC
TTGACGCCGG ACCAGATCGC GTCACTTGGC GGGGACGAGT TGCAGGTGGA GACGTTCCCA
CAAGCAGCCG TCCACTTCCT GTCGTTCAAC CAGGCGGTCG AAAGCCTAAC GCCCCCCGCC
GTATGGGAAG CCGCGCGCTA TCTGGTGGAT TACGAGGGGA TGACCAACTC GATCATCGCA
GGCCAGATGG AAATCCATCA GGCGTTCTGG CCAGAAGGGT TCCCCGGCGC GTTGACCGAC
ACGCCCTACA CCTATGATCC GGAGCGCGCC GCGCAGATCC TGGAAGATGC AGGGATTGAG
CTGCCGATCA CCGTCACGCT CGATGTGATC AACGCAGCAC CCTTCACCGA TATGGCGCAA
TCGTTGCAGG CGTCTTTCGC CGAAGCGGGC ATCGAGTTCG AAATCCTGCC CGGCACCGGA
TCACAGGTGA TCACCCGCTA CCGCGACCGC AGCCATGAGG CGATGTTGCT CTACTGGGGC
CCGGACTTCA TGGATCCCCA TTCCAACGCC AAAGCCTTCG CCTACAATTC CGACAATCGG
CAGGAAACCT ACACCGCCAC GACGACATGG CGGAACTCCT GGGCGGTGCC GGAAGAGATG
AACGCGATGA CGACGGCGGC CCTGACGGAA TCCGATCCGG CTGTGCGTGA AGAGATGTAT
CTGGAGCTTC AGCGGCAGGT GCAGGCGAAC TCGCCCATCG TGATCATGTT CCAGGCCTCC
TATCAGGTGG GTATGGCCGA GAATGTGTCA GGCTACGTGA ATGGTGCGAC GTCTGACTTC
GTGTTCTACC GGCTTGTCGA CAAAAGCTGA
 
Protein sequence
MKLFKPALLS AMLALPLAAP AALAETPPEI LVVAQNIDDI VAIDPAQAYE FTSGELVTNT 
YDRLVQYDAE DTTVLAPGLA TEWEIDAEAK TIVFTMREGV TFHSGNAFTA DDVVGSFARV
VQLNLTPAFI LTQLGWTPEN VEEMVTADGN TVTVRYDGDF SPAFVMNVLA SRPASIVDME
TVMANAVDGD MGNAWLNANT AGTGPFSLNT FRAAELIRMD ANPDYFNGAP AIEGVIIRHV
AESATQQLLL EAGDVDIARN LTPDQIASLG GDELQVETFP QAAVHFLSFN QAVESLTPPA
VWEAARYLVD YEGMTNSIIA GQMEIHQAFW PEGFPGALTD TPYTYDPERA AQILEDAGIE
LPITVTLDVI NAAPFTDMAQ SLQASFAEAG IEFEILPGTG SQVITRYRDR SHEAMLLYWG
PDFMDPHSNA KAFAYNSDNR QETYTATTTW RNSWAVPEEM NAMTTAALTE SDPAVREEMY
LELQRQVQAN SPIVIMFQAS YQVGMAENVS GYVNGATSDF VFYRLVDKS