Gene Caul_2833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2833 
Symbol 
ID5900288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3073022 
End bp3074272 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content64% 
IMG OID641563325 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_001684458 
Protein GI167646795 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.371933 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.138606 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGTT CCAATTCCCC CGCCGCGGCG ACCGACTTCT TCCACGACGC GCCGTCGCTG 
CCGGCCGTGC CGGAAACCCC GGTGCGCAAG TTCAACATCA ACTTCGGCCC GCAACACCCG
GCCGCGCACG GTGTGTTGCG CCTAGTGCTG GAGCTGGACG GCGAGATCGT CGAGCGCGTC
GATCCGCACA TCGGCCTGCT GCATCGCGGC ACCGAGAAGC TGATGGAGGC CCGCACCTAC
CTCCAGAACA TCCCCTATTT CGACCGCCTC GACTACGTGG CGCCGATGAA CCAGGAACAT
GCGTTCTGCC TGGCCATCGA GAAGCTGCTG GGCGTGGATG TGCCGCTGCG CGGCAGCCTG
ATCCGCGTGC TGTTCTGCGA GATCGGCCGG GTGCTGAACC ACCTGCTGAA CGTGACGACC
CAGGCCATGG ATGTCGGCGC CCTGACGCCG CCGCTGTGGG GTTTCGAGGA GCGCGAGAAG
CTGATGGTGT TCTACGAGCG CGCCTGCGGC GCTCGCCTGC ACGCCAACTA CTTCCGCCCG
GGCGGCGTCC ACCAGGACCT GACCCCGTCG CTGATCGACG ATATCGAGAA GTGGGCGAAA
GCCTTCCCGA AGATCTGCGA CGATATCGAA GGCCTGATCA CCGACAACCG CATCTTCAAG
CAGCGCAATG TCGACATCGG CGTCGTGACC AAGGAAGACG CCCTGGCCTG GGGCTTCTCC
GGTGTGATGG TGCGCGGTTC GGGCATCGCC TGGGACCTGC GCCGCAACCA GCCCTACGAA
TGCTACAATG ACTTCGAGTT CGACATCCCG CTGGGCAAGA ACGGCGACTG CTACGATCGC
TATCTGTGCC GCATGCAGGA GATGCGCGAG TCGACCAAGA TCATCCTGCA GGCCATCGAA
AAGCTGCGCG CCACGCCCGG CCCGGTGATG ACGCAGGACA ACAAGGTGGC TCCACCGCGC
CGCGCCGAGA TGAAGCGGTC GATGGAAGCC CTGATCCATC ACTTCAAGCT CTATACCGAA
GGCTTCCGCA CGCCGGAGGG CGAGGTCTAC GCTTGCGTCG AGGCCCCCAA GGGCGAGTTT
GGCGTGTTCC TGGTGTCGAA CGGCACCAAC AAGCCCTATC GCTGCAAGAT CAAGGCGCCG
GGCTTCTCGC ACCTGGCGGC TATGGACTGG ATGAACCGCG GCCACCAACT GGCCGACGTC
TCGGCCATTC TGGGTTCGCT CGACATCGTG TTCGGCGAGG TCGACCGGTG A
 
Protein sequence
MTGSNSPAAA TDFFHDAPSL PAVPETPVRK FNINFGPQHP AAHGVLRLVL ELDGEIVERV 
DPHIGLLHRG TEKLMEARTY LQNIPYFDRL DYVAPMNQEH AFCLAIEKLL GVDVPLRGSL
IRVLFCEIGR VLNHLLNVTT QAMDVGALTP PLWGFEEREK LMVFYERACG ARLHANYFRP
GGVHQDLTPS LIDDIEKWAK AFPKICDDIE GLITDNRIFK QRNVDIGVVT KEDALAWGFS
GVMVRGSGIA WDLRRNQPYE CYNDFEFDIP LGKNGDCYDR YLCRMQEMRE STKIILQAIE
KLRATPGPVM TQDNKVAPPR RAEMKRSMEA LIHHFKLYTE GFRTPEGEVY ACVEAPKGEF
GVFLVSNGTN KPYRCKIKAP GFSHLAAMDW MNRGHQLADV SAILGSLDIV FGEVDR