Gene Caul_5443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5443 
Symbol 
ID5897138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp156092 
End bp157720 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content61% 
IMG OID641550730 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_001672216 
Protein GI167621708 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.078996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGACT ATATTATTGT TGGAGCGGGG TCTGCCGGAT GCTTGTTGGC GGAGCGCTTG 
TCAGCCAATC CCAGGACGCG GGTCTGTCTG CTTGAGGCGG GCCCGCCCGA CCGCAGCCCG
CTGATCCACA TGCCCATTGG GATAGCGCTT CTGTCAAAGA GCAAAATTCT CAATTGGGCA
TTCGAGACGC AGCCACAGGC CAATCTCGAT GGTCGACGGC TGTTTTGGCC GCGCGGCAAA
ACCCTTGGCG GATCGAGTTC GATCAATGCG ATGGTCTATA TCCGCGGGCA CCGGGATGAC
TATGACTCCT GGGGCGAGGC AGCCGATCCG ATCTGGTCCT ATGACAATGT GCTCCCGCTG
TTCAAGGCGA TGGAGTCCAA CGAGAGATTT GGAACCGACG CGTTTCATGG CGGCGATGGT
GAGCTTCACG TCAGCGACCT GCGAACCCGC AACCCCTTGA GCGATGCCTT CGTCGAGGCC
GGACAACAGG CCCAGTTTCC GCATGCCGTC GATTTCAATG GGAAGATGCA GGACGGCGTC
GGCCTGTACC AGGTCACCCA GCACAAAGGC CGGCGCTGGA GTTCCGCGCG CGCCTTTCTT
TCCAAGGCCA AGGGCCGGCC CAATCTACGG ATAGTCACGG GCGCGCGGGC TACCCGGATC
ATTCTGGAGG GCCGCAAAGC GGTCGGCGTG ACCTATGCCG CAGGCGGCAA GCTGGTCGAT
GTGCGAACCA GGGGCGGCGA GGTCATTCTT TCGGGCGGCG CCGTCAATTC CCCGCAACTG
CTGCTGCTTT CCGGCATCGG CGGCGCGGCC GAGCTGAACG CACTCGGCAT TCCGGTGGTC
GTCGACCTTC CGGCAGTTGG AAAAAATCTG CAGGATCACC TCGATATCAC AATCATGCAT
GAGGCGAACG ATCGTACACC GATCGGCATC GCACCGTCAT TCATCCCGCG GGCGCTGTCC
GGAGCGCTAT CCTACGCCTT CCTTCGAAAG GGTTTCTTGA CGAGCAACGT CGCCGAGGCG
GGCGGCTTCG TCAAAAGCAC ACCTTCGCGG AGTCGGCCGA ATCTACAGTT TCATTTCCTC
CCCACGCTTT TGAAGGACCA TGGGCGCGAA ATGGCGTTCG GGTATGGCTA TACATTGCAT
GTCTGCGATC TTCTGCCCAA GAGCCGAGGC CGCATCGGGC TCACAAGCCC CGACCCGCTC
GACGATCCGC TGATCGATCC AAACTATCTC TCGGCCCCCG AAGACATTGA GACCATGGTC
GCGGCGGTGA AGATCGGCCG GCAAATTCTG TCGGCGCCGT CAATGGCGGC CTTCTCGAAA
ACCGAACTGG TCCCTGGGCC ATCGGTCCAG AGCAAGGCGG ATATCATGGC GGATATCCGT
CGGCGAGCGG AGACGATCTA TCATCCGGTG GGAACATGCC GGATGGGACG AGACCCTCAG
TCGGTTGTCG ATCCGTCACT CCGAGTGCGT GGCGTGCAAG GCCTTCGCGT CGTCGACGCC
TCGGTCATGC CGACGCTGGT CGCCGGAAAC ACCAACGCCC CGACGATGAT GATTGCGGAA
AGAGCTGCCG AGCTCATTCT TGGGAAGACG AAACTCGCAC TCAGCGCCAA CATTGAGGCA
TTCCGCTAA
 
Protein sequence
MFDYIIVGAG SAGCLLAERL SANPRTRVCL LEAGPPDRSP LIHMPIGIAL LSKSKILNWA 
FETQPQANLD GRRLFWPRGK TLGGSSSINA MVYIRGHRDD YDSWGEAADP IWSYDNVLPL
FKAMESNERF GTDAFHGGDG ELHVSDLRTR NPLSDAFVEA GQQAQFPHAV DFNGKMQDGV
GLYQVTQHKG RRWSSARAFL SKAKGRPNLR IVTGARATRI ILEGRKAVGV TYAAGGKLVD
VRTRGGEVIL SGGAVNSPQL LLLSGIGGAA ELNALGIPVV VDLPAVGKNL QDHLDITIMH
EANDRTPIGI APSFIPRALS GALSYAFLRK GFLTSNVAEA GGFVKSTPSR SRPNLQFHFL
PTLLKDHGRE MAFGYGYTLH VCDLLPKSRG RIGLTSPDPL DDPLIDPNYL SAPEDIETMV
AAVKIGRQIL SAPSMAAFSK TELVPGPSVQ SKADIMADIR RRAETIYHPV GTCRMGRDPQ
SVVDPSLRVR GVQGLRVVDA SVMPTLVAGN TNAPTMMIAE RAAELILGKT KLALSANIEA
FR