Gene Caul_2828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2828 
Symbol 
ID5900283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3070308 
End bp3071627 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content66% 
IMG OID641563320 
ProductNADH dehydrogenase I subunit F 
Protein accessionYP_001684453 
Protein GI167646790 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.256146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.149057 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGGTA TCCTCGAAGA CAAGGACCGC ATTTTCACGA ACCTCTACGG TCTCCAGGAT 
TGGGGCCTTG AGGGCGCGAA GAAGCGCGGC TGCTGGAATG GCACCAAGGA CATCCTGGAC
GCCGGGCGCG ACTGGATCAT CGACAACATG AAGAACTCCG GCCTGCGCGG CCGGGGCGGG
GCGGGCTTCG GCACCGGCCT GAAGTGGTCG TTCATGCCCA AGGAAGTGAA GGACGGCCGT
CCGCATTATC TGGTCGTCAA CGCCGACGAA TCCGAGCCGG GCACCTGCAA GGACCGGGAG
ATCATGCGGC ATGATCCGCA CCTCCTGATC GAAGGCTGCC TGATCGCCTC GCGCGCCATG
CTAGCCCATG CCTGCTACAT CTACATTCGC GGCGAATATG TCCGCGAGCG TGAAGTGCTT
GAGGCAGCGA TCAAGCAGGC CTACGAGGCC AAGCTGATCG GCAAGAACAA CGTCCACGGC
TGGGACTTCG ACCTCTACGT CCACCACGGG GCCGGCGCCT ATATCTGCGG CGAAGAGACG
GCCCTGCTGG AAAGCCTGGA AGGCAAGAAG GGCCAGCCGC GCCTGAAGCC GCCGTTCCCG
GCCGGAGCGG GCCTCTACGG CATGCCCACC ACGGTCAACA ACGTCGAGAG CATCGCCGTG
GCCGGCACGA TCCTGCGTCG CGGCGCAGCC TGGTTCGCGG GCTTTGGCCG TCCGAACAAC
ACCGGCACCA AGCTCTTCTG CGTCAGCGGG CACGTGAACC TGCCCTGCAA TGTCGAAGAA
GCGATGAGCA TCCCGTTCCG TCAGCTGATG GAAGACCACT GCGGCGGCAT TCGCGGCGGC
TGGGGCAACC TGAAGGCCGT CATCCCGGGC GGTTCGTCCG TACCGATGAT CCCGGCCGAG
CAGTGCGAAG ACCTGCCGAT GGACTTTGAC GCCCTGCGCA ACCTGCGCTC GGGCCTTGGC
ACCGCCGCCG TCATCGTCAT GGACAAGGAC ACAGACCTCG TCCGCGCCAT CGCCCGCCTG
AGCTACTTCT ACAAGCACGA GAGCTGCGGC CAGTGCACGC CGTGCCGCGA AGGCACCGGC
TGGATGTGGC GGGTCATGGA GCGCATGGCC ACCGGCGAGG CCGATCCGAA AGAGATCGAC
ACCCTGCTGG ACGTCACGAC CCAGGTCGAG GGTCACACCA TCTGCGCCCT GGGCGACGCG
GCCGCCTGGC CGATCCAGGG CCTGTTCCGT CACTTCCGCC ACGAGGTGGA GGACCGGATC
GCATCCTATC GTAGCGGTCG CCTGCACGTG CAGGGCGCCA GCCTGATCGC GGCGGAGTAA
 
Protein sequence
MVGILEDKDR IFTNLYGLQD WGLEGAKKRG CWNGTKDILD AGRDWIIDNM KNSGLRGRGG 
AGFGTGLKWS FMPKEVKDGR PHYLVVNADE SEPGTCKDRE IMRHDPHLLI EGCLIASRAM
LAHACYIYIR GEYVREREVL EAAIKQAYEA KLIGKNNVHG WDFDLYVHHG AGAYICGEET
ALLESLEGKK GQPRLKPPFP AGAGLYGMPT TVNNVESIAV AGTILRRGAA WFAGFGRPNN
TGTKLFCVSG HVNLPCNVEE AMSIPFRQLM EDHCGGIRGG WGNLKAVIPG GSSVPMIPAE
QCEDLPMDFD ALRNLRSGLG TAAVIVMDKD TDLVRAIARL SYFYKHESCG QCTPCREGTG
WMWRVMERMA TGEADPKEID TLLDVTTQVE GHTICALGDA AAWPIQGLFR HFRHEVEDRI
ASYRSGRLHV QGASLIAAE