Gene Cpha266_0965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0965 
Symbol 
ID4570734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1105508 
End bp1106626 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content48% 
IMG OID639765568 
Productrespiratory-chain NADH dehydrogenase, subunit 1 
Protein accessionYP_911437 
Protein GI119356793 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00465289 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGTAA TGGCTTTATC GCAAATCCGT ATTCCTCTGC TTATGGGTAA CAGTCTCAAT 
GCCTGGTCGG AAGCCCTTAC CGGTTTTTCG ATCTGGGGAT TTCCTCTTGG TCTTGTCATT
CTTGCCGCCA TTCCGTTAGT TTTTATTGCG CTTTACGCTC TGACATACGG AGTCTACGGC
GAACGGAAAA TTTCCGCATT CATGCAGGAC AGGCTTGGTC CGATGGAGGT TGGCAAATGG
GGTATTCTGC AGACCCTTGC CGATATTCTC AAGCTTTTGC AGAAAGAGGA TATTGTTCCT
GCCGCTGCTG ACAAATTTCT TTTTGTCGTT GGCCCCGGAA TTCTGTTTGT CGGCTCCTTT
CTTGCATTTG CCGTGCTTCC GTTCAGTTCT GCTTTTATTG GTGCCAATTT AAATGTAGGC
CTCTTTTATG CAATCGGCAT CGTATCCATT GAAGTGGTCG GTATTCTTGC TGCCGGCTGG
GGATCAAACA ACAAGTGGTC GCTCTATGGA GCGGTTCGGA GTGTCGCCCA GATAGTCAGC
TATGAAATTC CTGCCGGAAT TGCCCTTTTG TGCGGAGCCA TGATGGCAGG AACGCTTGAT
ATGCAGCAGA TAACAATGCT CCAGTCCGGT CATCTCGGGT TTGCCCATTT CAATCTTTTT
CAGTCGCCGA TTGCCTGGCT TCCTTTTCTG ATCTATTTCA TCGCTTCGCT TGCAGAGGTT
AATCGGGCCC CTTTTGATAT TCCCGAAGCC GAATCCGAGC TTGTTGCCGG TTATTTTACC
GAGTATAGCG GGATGAAATT TGCGGTTATT TTTCTTGCCG AATATGGTAG TATGTTTATG
GTTTCAGCCG TTCTCTCCAT TGTTTTTCTT GGAGGCTGGA ACTCGCCTCT TCCCGATCTT
GGCCCTGTAT CGCTCAATGC CATGACAAGT GGCCCTGTGT GGGGGGTCTT CTGGATTATT
TCGAAGGGAT TTTTCTTTAT TTTTGTGCAG ATGTGGCTGC GCTGGACCCT GCCTCGTTTG
AGGGTTGATC AGTTGATGTA CCTCTGCTGG AAAGTTCTGA CACCGTTCGC TTTTATCGGA
TTTGTTCTGA CGGCGATCTG GGAAATTTAT GTGCCATAG
 
Protein sequence
MSVMALSQIR IPLLMGNSLN AWSEALTGFS IWGFPLGLVI LAAIPLVFIA LYALTYGVYG 
ERKISAFMQD RLGPMEVGKW GILQTLADIL KLLQKEDIVP AAADKFLFVV GPGILFVGSF
LAFAVLPFSS AFIGANLNVG LFYAIGIVSI EVVGILAAGW GSNNKWSLYG AVRSVAQIVS
YEIPAGIALL CGAMMAGTLD MQQITMLQSG HLGFAHFNLF QSPIAWLPFL IYFIASLAEV
NRAPFDIPEA ESELVAGYFT EYSGMKFAVI FLAEYGSMFM VSAVLSIVFL GGWNSPLPDL
GPVSLNAMTS GPVWGVFWII SKGFFFIFVQ MWLRWTLPRL RVDQLMYLCW KVLTPFAFIG
FVLTAIWEIY VP