Gene Cphamn1_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1603 
Symbol 
ID6375281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1729709 
End bp1730827 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content48% 
IMG OID642684091 
ProductNADH dehydrogenase (quinone) 
Protein accessionYP_001960005 
Protein GI189500535 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACCG GAGCTTTTCT GCCAAACAGT TTTCCCATTG TTATGAGTAC AAGCCTCAAC 
GCCTGGTCTG ATGCTCTCTC TGAGTATTTC CTTTTTGGAC TGCCTGTCGG CCTTGTCATT
TTAGCTGCGC TTCCTCTCGT ATTTATTGCG CTCTACGCGT TGACATACGG TGTTTACGGA
GAGAGAAAGA TTTCCGCGTT CATGCAGGAC CGGCTTGGCC CGATGGAGGT CGGTAAATGG
GGGATTCTTC AGACTCTTGC CGATATTCTG AAACTGCTTC AGAAGGAGGA TATTGTCAAC
AAGTCAGCTG ACAAGTTTCT TTTTGTTATC GGCCCCGGAG TTCTTTTTGT CGGTTCATTT
CTCGCTTTTG CTGTACTTCC GTTCGGTCCC GCGTTTATCG GTGCCGATCT CAATGTGGGT
CTCTTCTATG CCATAGGAAT TGTCGCCCTC GAAGTTGTCG GTATTCTTGC CGCCGGCTGG
GGATCAAACA ACAAATGGGC TCTTTACGGA GCTATCCGAA GCGTTGCCCA GATAGTCAGT
TATGAGATTC CTGCAGCTAT CGCCATTCTG TGCGGTGTCA TGATGGCAGG GACACTCAGT
ATGCAGCAGT TTAATATTCT GCAGCAGGGC GAGTATGGTT TTCTGCACTT TTTCCTTTTC
CAGAACCCTA TCGCCTGGCT TCCGTTTCTT ATCTACTTTA TCGCGTCCCT TGCCGAGACA
AATCGTGCTC CTTTTGATAT ACCTGAAGCT GAATCCGAGC TTGTTGCCGG TTATTTCACA
GAGTACAGCG GTATGAAATT CGCTGTGATC TTTCTTGCGG AATATGCCAG TATGTTTATG
GTTTCAGCGA TCATTTCAAT TGTTTTTCTG GGAGGCTGGA ATTCACCGTT TCCCAATATC
GGTCCGCTGT TGCTTAATGA CTGGACAACC GGTCCCGTAT GGGGGGCATT CTGGATCATC
ATGAAAGGTT TCTTCTTCAT TTTTATCCAG ATGTGGCTCA GATGGACGCT TCCAAGACTG
AGAGTTGATC AGCTGATGCA TGTCTGCTGG AAAGTGTTGA CCCCGTTTGC TTTTGTGGCA
TTCGTTCTGA CGGCGATATG GGAGATTTAT GTCAAATAG
 
Protein sequence
MSTGAFLPNS FPIVMSTSLN AWSDALSEYF LFGLPVGLVI LAALPLVFIA LYALTYGVYG 
ERKISAFMQD RLGPMEVGKW GILQTLADIL KLLQKEDIVN KSADKFLFVI GPGVLFVGSF
LAFAVLPFGP AFIGADLNVG LFYAIGIVAL EVVGILAAGW GSNNKWALYG AIRSVAQIVS
YEIPAAIAIL CGVMMAGTLS MQQFNILQQG EYGFLHFFLF QNPIAWLPFL IYFIASLAET
NRAPFDIPEA ESELVAGYFT EYSGMKFAVI FLAEYASMFM VSAIISIVFL GGWNSPFPNI
GPLLLNDWTT GPVWGAFWII MKGFFFIFIQ MWLRWTLPRL RVDQLMHVCW KVLTPFAFVA
FVLTAIWEIY VK