Gene Jann_4134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_4134 
Symbol 
ID3936623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp4243673 
End bp4245223 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content61% 
IMG OID637906520 
Productextracellular solute-binding protein 
Protein accessionYP_512076 
Protein GI89056625 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAAAT CTACCCATTT GCTTGCAGCC GGTGCTGCCA GCTTGGCCCT CACGACGGGC 
GCGTTTGCCC AGGCGGACGA TACGCTTGTT GTCGGCCTCA GCACAGATAT CAGCACGCTG
GACCCCGCGC CGATCTCTTC GCGCGACAAT TCCAACATCG CGCGCCACAT CTTCGGCACG
CTCTATACGA TGAGCGGCGA TGGTGAGGCC CTGCCGGTGC TGGCCAACAA CCTTGAGATC
TCCGACGATG GGCTGGCCTA CGTCTATACA CTGAATGAGG GTCTGACCTG CCACGATGGC
GAAGCGCTGA CAGCGGAAGA TGCGGCCTAT TCCTTCAACC GCGCTGCTGA TCCGGAAAAT
GCCTTCACCG GCAACACGCC CGGCTTCATC TTTTCCTCGA TCGATTTTCA AGGCGCAGAA
GCGCTGGACG AGCTGCGCGT GCAGGTGAAT ATCGGCGCGC CAAACCCGAT TGCCTTCGGC
CTGATCGCGG AGGTGTTCAT CCACTGCATG GACAGCTATG AGGCGATGAC CCTTGATGAG
GCGGCCAGCA ATCCGATTGG ATCCGGCCAC TACCGCCTGG TGGAATGGCG TCGCGGTTCC
GAGGTCGTGC TTGAGGCTGT CGAAGGCGCA GAGGTGGGCT TCCAGAACCT CGTCTGGCGG
ATCATCCCCG AAGCCTCCAC TCGCGCGGCG GAACTGATGG CCGGTTCCGT TGACATCATC
ACCAATGTGG CCCCGGACCA GATGGATGTG ATCAACGCAT CGGGCGCGGC AGAGGTGAAC
CCGATCCAGG GCACACGCCG CATGTATGTG GGCTTCAACC TGTCCGAGAC GATGGCGCAG
GAACCCGGCG GCGACGCGAT CCAGGATCCG GCCGTGCGTC GGGCGCTGCA ATATGCGGTC
AACGTTCCGG CGATTTGTTC GCAGCTTCTG AACTTTGAGT GTGAGCGGAT GACCGGCATC
GTGAACCCGC CCAATGCCAA CCAGAGCCTG GAGCCCTACC CCTATGATCC GGAAGAGGCC
GAGCGTCTGC TGGATGAGGC GGGTTGGCCC CGGGGCGACG ATGGCACCCG CTTCTCAATC
GCCTTCCAAG CCGGTCAGGG TCGTTACCTG AATGACGCCA ACGTCGTGCA GGCCATCGCC
CAATATCTCA GCGATGTGGG TCTGGATGTC GACTTGCAGA TCATGGAATG GTCCAGCGTC
TACATTCCGA TCATCCGGGA ACGCAATGCT GGCCCGCTCT ACTTCATCGG CTCCGGTGGG
GCGTTGTGGA GCCCGCTCTA TGACATGACC GACCTGGCCG CTGTCGATTC AGGCACCAAC
TACACCCATT GGGATGACCC CCGCTGGTTT GACCGTTGGT CCGACATCGC CGCCGCCGAG
ACGGAGGAGG AGACCCGCGA GATCGTGGAT GAGATGTTGC AGGTCTTCTA CGATGATGGC
CCCTGGCTGC ACCTCTACTT CCAGCCGGAC TTCTACGGTG TGTCCAACCG CGTGAACTGG
ACCCCGCGGC CCGATGAGAA GGTTTACCTC TGGGATGCGA CGCTGAACTA A
 
Protein sequence
MIKSTHLLAA GAASLALTTG AFAQADDTLV VGLSTDISTL DPAPISSRDN SNIARHIFGT 
LYTMSGDGEA LPVLANNLEI SDDGLAYVYT LNEGLTCHDG EALTAEDAAY SFNRAADPEN
AFTGNTPGFI FSSIDFQGAE ALDELRVQVN IGAPNPIAFG LIAEVFIHCM DSYEAMTLDE
AASNPIGSGH YRLVEWRRGS EVVLEAVEGA EVGFQNLVWR IIPEASTRAA ELMAGSVDII
TNVAPDQMDV INASGAAEVN PIQGTRRMYV GFNLSETMAQ EPGGDAIQDP AVRRALQYAV
NVPAICSQLL NFECERMTGI VNPPNANQSL EPYPYDPEEA ERLLDEAGWP RGDDGTRFSI
AFQAGQGRYL NDANVVQAIA QYLSDVGLDV DLQIMEWSSV YIPIIRERNA GPLYFIGSGG
ALWSPLYDMT DLAAVDSGTN YTHWDDPRWF DRWSDIAAAE TEEETREIVD EMLQVFYDDG
PWLHLYFQPD FYGVSNRVNW TPRPDEKVYL WDATLN