Gene Jann_1492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1492 
Symbol 
ID3933939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1462264 
End bp1463934 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content59% 
IMG OID637903842 
Productcholine dehydrogenase 
Protein accessionYP_509434 
Protein GI89053983 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCAG ATTATGTCGT TATTGGAGCG GGATCAGCAG GTTGCGCGGT GACCTACAGG 
TTGGCAGAGG CCGGGAAATC TGTGCTCGTT GTCGAGCATG GCGGGTCCGA TTGGGGGCCA
TTCATCAACA TGCCCGCCGC GCTGAGCTAT CCGATGGGCA TGAAGCGCTA CGACTGGGGC
TACGTGACCG AGCCCGAACC GCACATGAAC AATCGCGTCA TGGCCTGTCC CCGTGGCAAG
GTGGTCGGAG GGTCCTCCTC AATCAACGGG ATGATCTACG TTCGCGGCCA TGCGCGCGAC
TTTGACACCT GGGCAGAAAT GGGAGCCGAT GGTTGGTCCT ATGCCGACGT GCTGCCTTAT
TTCAAACGCG CGGAGACCTG GCACGGCGAC GCCGGGGAGC CCGCCTTCCG AGGCAGTGAC
GGGCCGGTTC ACGTCACCCG GGGCACGCGC AAAAATCCGC TTTATCAGGC GTTTATCGAC
GCTGGTATGC AGGCTGGTTA TGGCGCGACG GACGATTACA ACGGATATCG GCAGGAGGGG
TTCGGCGCGT TTGAGATGAC CGTTTACAAA GGTAAACGGT GGTCTGCGGC CAGCGCCTAT
CTTAGGCCCG CACTTGCCAA ACCAAACTGC GATATGGTGC GTGGCCTGGT GCAACGGATC
GAGTTTAAGG AGGGCCGGGC AACGGGTGTT CGTCTGGCAG ATGGCAGCCT GATCCGAGTG
CGCTGCGAAG TGGTCCTGTG CGCCGGTGCG ATCAATTCAC CTAAAATTCT AATGCTTAGT
GGAATTGGGC CAGCCAAGCA TTTGGCAGAA CATGGTATTT CAGTGGTCGC GGACCGTGCG
GGCGTCGGTC AAAACCTGCA AGATCACCTG GAAATGTATA TCCAGTACGC CGCATCCAAG
CCTGTTTCGA TCGCACCCTA CTGGTCGCTC TGGGGAAAAG CGGCGGTGGG CGCGCAATGG
CTGTTCACCA AGACAGGGCT GGGGGCCACC AACAACTTTG AATCCTGTGG CTTCATTCGC
TCCAGCGCAG GGGTGGAATA TCCTGATATT CAATACCATT TTCTTCCAAT CGCGATCCGG
TATGACGGCC AGATGCCCCC GGGCGGGCAC GGTTTCCAAG CCCATACCGG CCCCATGCGG
TCCCCCTCCC GGGGTGAAAT CACGCTCCGC AGTCAAGACC CAGCGCAAGC TCCGAAAATC
CAGTTCAACT ACATGTCCCA CGAAAAGGAC TGGCGGGATT TCCGCCGCGC TATCCGCCTG
ACGCGCGAGA TCTTCGCCAC CGAACCCATG GCGGAGTACG TCGATCATGA GATCCAACCC
GGCGATGCCG CGCAATTCGA TGACGCCTTG GATGCGGTCA TTCGCGAGCA TGCTGAATCG
GCTTACCACC CCTGTGGAAC CGCACGAGTG GGGCAGAGAA ATGACCCAAT GTCGGTTGTA
GATCCCCAGA CCTCGGTGAT TGGCGTCAGC AGTTTGCGCG TGGCAGACAG TTCGATTTTT
CCTTTGATTC CAAATGGAAA CCTGAACGCT CCATCAATTA TGGTCGGGGA AAAGGCCGCG
GATCACATCC TGGGTCGCCG TGTGCCCGCC GAGAATCTGA CGCCTTGGAT CGCACCGGAT
TGGCAGTCGA CACAACGGGA AAGCCATGCC GTAAAGGCTG CGGCTGAATA G
 
Protein sequence
MEADYVVIGA GSAGCAVTYR LAEAGKSVLV VEHGGSDWGP FINMPAALSY PMGMKRYDWG 
YVTEPEPHMN NRVMACPRGK VVGGSSSING MIYVRGHARD FDTWAEMGAD GWSYADVLPY
FKRAETWHGD AGEPAFRGSD GPVHVTRGTR KNPLYQAFID AGMQAGYGAT DDYNGYRQEG
FGAFEMTVYK GKRWSAASAY LRPALAKPNC DMVRGLVQRI EFKEGRATGV RLADGSLIRV
RCEVVLCAGA INSPKILMLS GIGPAKHLAE HGISVVADRA GVGQNLQDHL EMYIQYAASK
PVSIAPYWSL WGKAAVGAQW LFTKTGLGAT NNFESCGFIR SSAGVEYPDI QYHFLPIAIR
YDGQMPPGGH GFQAHTGPMR SPSRGEITLR SQDPAQAPKI QFNYMSHEKD WRDFRRAIRL
TREIFATEPM AEYVDHEIQP GDAAQFDDAL DAVIREHAES AYHPCGTARV GQRNDPMSVV
DPQTSVIGVS SLRVADSSIF PLIPNGNLNA PSIMVGEKAA DHILGRRVPA ENLTPWIAPD
WQSTQRESHA VKAAAE