Gene Caul_2820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2820 
Symbol 
ID5900275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3061842 
End bp3063326 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content64% 
IMG OID641563312 
ProductNADH dehydrogenase subunit M 
Protein accessionYP_001684445 
Protein GI167646782 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.22208 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCC TTCTCAGCCT GACCACCTTC GCCCCGCTCG TCGGTGTCGC CGTCCTGCTG 
GCCGTTCGCG CCCTCCAGGG CGACAACGCC AAGACCGACA GTCTGGCCAA GTGGATCGCC
CTGGCGACCA CCCTGGTCAC CTTCGCCCTG TCTATCATCC TGGTCGCCAA GTTCGACGCC
AGGAACCCCG GCTTCCAGTT CGTGGAAGAC GCGGCCTGGT TCGCGGGCCT GCACTACCGC
ATGGGCGTGG ATGGCATCTC GGTGCTGTTC GTCCTGCTGA CCGCGTTCCT GCTGCCGATC
TGCGTGATCG CCAGTTGGAA GTCGATCGAC AAGCGGGTCG TGGAATACAT GATCTGCTTC
CTGGTCCTGG AGACCCTGGT GATCGGCGTG TTCTGCGCCC TGGACCTGAT CCTGTTCTAC
CTCTTCTTCG AGGGCGGCCT GGTTCCGATG TTCCTGATCA TCGGCATCTG GGGCGGCAAG
CGCCGGGTCT ACGCAGCCTA CAAGTTCTTC CTCTACACCC TGCTCGGATC GGTGCTGATG
CTGGCGGCCA TCCTGTCGAT GATCGCCATC GCCCGCACCT CGTCGATCCC CGACCTGATG
CACTTCAAGT TCGCCCCGTG GCTTCAGACC TGGCTGTGGC TGGCCTTCTT CGCCAGCTTC
GCGGTCAAGA TGCCGATGTG GCCGGTGCAC ACCTGGCTGC CCGACGCCCA CGTCGAGGCG
CCCACGGCCG GTTCGGTGAT CCTGGCGGGT ATCCTGTTGA AGCTCGGAGG CTACGGCTTC
ATGCGCTTCA GCCTGCCGAT GTTCCCCAAC GCCTCGGAGC TGTTCCAGCC GCTGGTGTTC
GCGATGTCGG CCATCGCCAT CGTCTACACT TCGCTGGTCG CCTTCCGTCA GACCGACATC
AAGAAGCTGA TCGCCTATTC GTCGGTGGCC CACATGGGCT TCGTGACCAT GGGGATCTTC
TCGGGCAACG TCCAAGGCGA GCAGGGCGCG CTGTTCCAGA TGCTAAGCCA CGGGGTGATC
TCAGGCGCGC TCTTCCTCTG CGTCGGCGTG GTCTATGACC GGATGCACAC CCGCGAGATC
GCCTTCTACG GCGGCCTGAC CAACCGCATG CCCTGGTACG CGGCGGTGTT CATGCTGTTC
ACCATGGGCA ATGTCGGCCT GCCGGGCACC TCGGGCTTCG TCGGCGAGAT CCTGACCATG
GTCGGGACCT ACAAGGCCTC GACCTGGACG GCCTTGGTCG CCTCGACCGG CGTGATCCTG
TCGGCGGTCT ACGCCCTGAA CCTGTACCGC CGGGTGATGT TCGGAGAGAT CGTCAATCCC
GAGCTCAAGA CCATCGCCGA CCTCGACAAG CGCGAGATCC TGATCTTCGC GCCGCTGATC
ATCGCCACCC TGGTGCTCGG CGTTTACCCC AATCTCGTGT TCAACCTGAC CGCGTCGTCC
GTCGACGGGC TCGTCGGCGC CTGGCGCGCC GCCGTGGGCG GGTGA
 
Protein sequence
MTGLLSLTTF APLVGVAVLL AVRALQGDNA KTDSLAKWIA LATTLVTFAL SIILVAKFDA 
RNPGFQFVED AAWFAGLHYR MGVDGISVLF VLLTAFLLPI CVIASWKSID KRVVEYMICF
LVLETLVIGV FCALDLILFY LFFEGGLVPM FLIIGIWGGK RRVYAAYKFF LYTLLGSVLM
LAAILSMIAI ARTSSIPDLM HFKFAPWLQT WLWLAFFASF AVKMPMWPVH TWLPDAHVEA
PTAGSVILAG ILLKLGGYGF MRFSLPMFPN ASELFQPLVF AMSAIAIVYT SLVAFRQTDI
KKLIAYSSVA HMGFVTMGIF SGNVQGEQGA LFQMLSHGVI SGALFLCVGV VYDRMHTREI
AFYGGLTNRM PWYAAVFMLF TMGNVGLPGT SGFVGEILTM VGTYKASTWT ALVASTGVIL
SAVYALNLYR RVMFGEIVNP ELKTIADLDK REILIFAPLI IATLVLGVYP NLVFNLTASS
VDGLVGAWRA AVGG