Gene Jann_1420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1420 
Symbol 
ID3933867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1388400 
End bp1390271 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content58% 
IMG OID637903770 
Productthiamine pyrophosphate enzyme protein 
Protein accessionYP_509362 
Protein GI89053911 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3962] Acetolactate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.406796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.238501 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC CCACATCTAC CATTCGGCTG ACGGTAGCCC AGGCCATTGT GCGTTACCTC 
ATGAATCAGT TCATCGAGAT TGATGGCGTC GAGACCCGGA TCTGCGGAGG CGGATTCGGC
ATTTTCGGGC ACGGCAACGT GCCATGCCTG GGGGAAGCGC TTTACCCTGT GCAGGACGAG
ATGCCGCTTT ACCGTGGGCA GAACGAGCAA AGTATGGGGT TTGCCGCCGC CGCTTACGCG
AAATATCATC TGCGTCGCCG CTTCATGTTT TGCACGGCAT CAGCCGGGCC CGGAACGGCA
AACCTGCTTA CGGCATCTGC CTTGGCGCAC GCAAATCGCT TGCCATTGCT GATGTTGTGC
GGCGATACCT TTCTGACCCG TCTGCCCGAT CCGGTCCTGC AACAACTGGA GCACTTCGGA
AACCCAACGC TTGGCGTGAA CGATGCATTC AAGGCCGTGA CGCGCTTCTG GGATCGAATC
ACGCATCCCG CACAGCTGCT GAATGCGCTA CCTGCCGCCC TTGCGACGAT GCTGGACCCT
GCCGATTGCG GGCCCGCGTT CCTCGGATTG CCGCAAGATG TCCAGGGGTG GGCGTATGAT
TATCCCGAGG TCTTCTTCGC CCGCCATGTG CATCGCATCC GGCGACAAGC ACCTGATCCG
GCTGAGGTTT TTGATGCCGC AGCCCTTTTG GTTAACGCAA GGCGACCCGT GATCATCGCG
GGCGGCGGCG TGCAATATGC TGGCGCAGTG GACGCGCTGA CCCAATTTGC CGACACACAC
AACATTCCCG TGGTGGAGAC CATCGCAGGC CGCGCTAACA TGGTGGCAAC AAATCCGTTA
AATATCGGGC CCTTGGGTGT AACCGGGTCA GATTCAGCCA ACGCCATCGC CGCCGAGGCC
GATGTGATTT TGGCCGTTGG CACACGGCTT CAGGACTTCA CAACAGGTTC ATGGACAGCG
TTCGCCCATG ATGCGCAGAT CATCGGCATG AATGTTGGCC GTCATGATGC AGCAAAACAT
CTGTCTTTAC CGGTCGTTGG CTGTGCAAAA CTCAGCCTCC CCGCATTAAG CGCGGCACTG
TCAGACTACT CCGCGCCCGA GGCCTGGATG ACAAAAGCGC AGGCAGGACG CGCAAGTTGG
GACGCTTACG TGGTCGAGAA CGTGGCGCAC GGGAACCGCC CTAATTCCTA CGCGCAGGCC
ATCGGTGTGG TGAATGCGCT TTGCGACACA CGGGACCGCG TTGTCACGGC TGCGGGAGGG
CTTCCGGCCG AGGTCACTGC CAACTGGCGA ACGCTCGACA TTGGCACCGT CGATGTGGAG
TTCGGTTTCT CCTGTATGGG CTATGAGATC GCTGGTGGAT GGGGTGCAAA AATTGCACAA
TCCGAGCAAG AGCCCACAGC GGATACCATT GTTTTCGTCG GAGACGGTAG CTATCTCTTG
ATGAATTCCG ATATCTACTC AAGCGTTCTG ACTCGGAAGA AACTTATCGT TCTGGTCCTC
GACAACGGTG GTTTTGCGGT CATCAACAAG CTGCAAAACA ATACCGGAAA CGAGAGTTTC
AACAACCTTA TCGCGGATTG CCCCACTATA CCGGAGCCGT TCACGGTCGA CTTCGAAGCT
CACGCGCGCG CCATGGGGGC GCATGCGGAA ACCGTGTCCA ACCCAGCAGA ACTGGCAGAT
GCATTCCAGC GGGCGAAGAC GGCGGACAGG ACTTCTGTCA TCGTCATGAA GGTCGACCCC
TATGACGGCT GGACCACCGA AGGCCACACA TGGTGGGAAG TTGGGACGGC CCAGGTGTCC
GACAACCCGA ATGTCCGCGA AAAGCATGCG GAATGGGAAG CGGACCGTAA CAAGCAGCGA
CAGGGCGTGT GA
 
Protein sequence
MTEPTSTIRL TVAQAIVRYL MNQFIEIDGV ETRICGGGFG IFGHGNVPCL GEALYPVQDE 
MPLYRGQNEQ SMGFAAAAYA KYHLRRRFMF CTASAGPGTA NLLTASALAH ANRLPLLMLC
GDTFLTRLPD PVLQQLEHFG NPTLGVNDAF KAVTRFWDRI THPAQLLNAL PAALATMLDP
ADCGPAFLGL PQDVQGWAYD YPEVFFARHV HRIRRQAPDP AEVFDAAALL VNARRPVIIA
GGGVQYAGAV DALTQFADTH NIPVVETIAG RANMVATNPL NIGPLGVTGS DSANAIAAEA
DVILAVGTRL QDFTTGSWTA FAHDAQIIGM NVGRHDAAKH LSLPVVGCAK LSLPALSAAL
SDYSAPEAWM TKAQAGRASW DAYVVENVAH GNRPNSYAQA IGVVNALCDT RDRVVTAAGG
LPAEVTANWR TLDIGTVDVE FGFSCMGYEI AGGWGAKIAQ SEQEPTADTI VFVGDGSYLL
MNSDIYSSVL TRKKLIVLVL DNGGFAVINK LQNNTGNESF NNLIADCPTI PEPFTVDFEA
HARAMGAHAE TVSNPAELAD AFQRAKTADR TSVIVMKVDP YDGWTTEGHT WWEVGTAQVS
DNPNVREKHA EWEADRNKQR QGV