Gene Jann_2053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_2053 
Symbol 
ID3934506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp2057403 
End bp2059142 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content62% 
IMG OID637904409 
Productextracellular solute-binding protein 
Protein accessionYP_509995 
Protein GI89054544 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0131056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.653692 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA CAACTCAGCT GCTTTTGGGT ACGGTCGCGA TGACGCTGGC CCTGACCGGC 
GGCGCGGTGG CGCAGGATGT GGCGCGCGAG GATACGGTGA TCTTCGATCT CGACCGCACG
ATCCGGGACC CTGAAAACTT CAACTGGATG ACCGACGGCA CCGGCATTCG CCGCATGCAC
GGTGCGCATC AGGCAGTGTG GGAGCCTCTG TTCATCCTCA ACTATAACAC CGGTGAGCTG
GACCCGTGGC TTGCGACCGG CTTTGATGCC AACGACGACA GCACCGAGTT CACCATCACC
CTGCGCGAGG GCGTGGAATG GTCGGATGGA GAGGTATTCA ACGCCGAAGA TGTCGTCTTC
ACCGTGGAGA TGGCGCTTGG CAACGAAGAG TTGAACGCAC GCGAAGTCGC AACCCTGCGG
GGGCAGGTGG CCTCTGTCGC GATGGTGGAT GACCTGACCG TCACCTTCAC GCTCAACGCG
CCGAACCCGC GCTTCGTGTT GGAAAACTTC GGCGTGCGCA CGTTCGGCTC GTTCCTGATC
ATGCCGGAGC ATATCTGGTC GGCGGCAGAA AACCCCGCAA CCTTCACGTT CAACCCGCCC
GTGGGCACCG GACCCTATAC CTTCACGTCC GCCGCCACGA ACCGCGCGAT CTGGGACCGC
GACGATGACT GGTGGGGCGT TGACGCGGGC TTCATGGATC TGCCAGAGCC GCTGCGCGTT
GTGTTCCTGG AATCGGGCGG CGAAGAAAGC CGCGCACAAT TGCTGGCTAC AAATGGGTTG
GACGCAGCGC AGAACATGTC CGTCGGCACG TTCGAGGCGG TGGGCTTCCA GAACGAAAAC
GTCATGGCGT GGTACGCGGA TTATCCTTTC GCCGTGGCTG ATCCCTGCGC GCGGCAGTTG
GAGATCAACA CCACGGTCGC ACCCTGGGAT AACGCCAACA TGCGCCGGGC GGTGAACCTG
ATCATTGATC GGGAGCAGAT CGTGAACATC GCCGCTGAAG GTGCGACGAC GGCCTCTACG
ACGATGTTCG CGCAGTTCGG CTCCATGTCG CCGTTCATTG ATGCGGTGGT AGAGGCCGGT
CATGGCCTCT CTGCCTCCGC CGATGTGGAG GCCGCGCAGG CTTTGCTGGA AGGCGAAGGC
TGGATGCGTG ACGGCGATTA CTACGCCAGG GATGGTGAGA CGCTGAGCGT GTCGATCCAC
GTCAACTCCG CCTCTACTGA ATACACACGC ACCATTGACG TGATCGTGGA GCAGTTGCAG
CGGGCAGGGA TCGACGCGCG GTCGGTCCCG GTTGAAAACT CCGTCTTCTG GGGTGAAGTG
CTGCCCTTTG GCGGGTTCGA GATGTCCTAC AGCTGGCTGT CATGCGGCTC GGTCAATGAG
CCATGGGCCT CCATGGGGCG CTACACGACG GCAGATGTGG TGCCGGTGGG CGAACGCTCC
CCCGGCTTCA ACAACACGCC CCGTTGGGAT ACGGCGGCCG CCGAAAGCTA CACCGCGATC
GTGAACGACA TGGCCGCACT GCCCCTGGGC GATCCTGCGG TGCCGGGCAT GGTGGCGGAG
GCATACCAGT ATCTGGACGC CGAAATGCCA TTCATCCCAC TGGTGCAGGC CTACAAGCTG
ATGCCCTTCA GCACGACCTA TTGGGAGGGC TGGCCGTCTG CGGACAATTA CTACAACCAC
CCGTTCTTCC ACTGGAATTC GGGCCATCAG ATCATCCACA ACCTGACGCG CGTGGAATGA
 
Protein sequence
MKKTTQLLLG TVAMTLALTG GAVAQDVARE DTVIFDLDRT IRDPENFNWM TDGTGIRRMH 
GAHQAVWEPL FILNYNTGEL DPWLATGFDA NDDSTEFTIT LREGVEWSDG EVFNAEDVVF
TVEMALGNEE LNAREVATLR GQVASVAMVD DLTVTFTLNA PNPRFVLENF GVRTFGSFLI
MPEHIWSAAE NPATFTFNPP VGTGPYTFTS AATNRAIWDR DDDWWGVDAG FMDLPEPLRV
VFLESGGEES RAQLLATNGL DAAQNMSVGT FEAVGFQNEN VMAWYADYPF AVADPCARQL
EINTTVAPWD NANMRRAVNL IIDREQIVNI AAEGATTAST TMFAQFGSMS PFIDAVVEAG
HGLSASADVE AAQALLEGEG WMRDGDYYAR DGETLSVSIH VNSASTEYTR TIDVIVEQLQ
RAGIDARSVP VENSVFWGEV LPFGGFEMSY SWLSCGSVNE PWASMGRYTT ADVVPVGERS
PGFNNTPRWD TAAAESYTAI VNDMAALPLG DPAVPGMVAE AYQYLDAEMP FIPLVQAYKL
MPFSTTYWEG WPSADNYYNH PFFHWNSGHQ IIHNLTRVE