Gene Cag_0643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0643 
Symbol 
ID3747320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp915019 
End bp916653 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content41% 
IMG OID637773179 
Productproton-translocating NADH-quinone oxidoreductase, chain M 
Protein accessionYP_378959 
Protein GI78188621 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.552071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAGTC TTATTGTTTT TCTCCCTCTT ATTGCCGGTC TGATTATTCT TGCCGTGCCA 
GCCTCGCAAA AGCAGGTGAT TAGGATAGTC TCGTTGCTTG CAGCATTGGT GCAAATGGTT
CTTGCTGTTA TGATTTGGCG CGATTATGAT CCTTCATTAG CGGGTATTAC TGCTGGTGCT
GGAGGCACGC TTGCGGGATC ATTTCAATTT GTAGAGCGTC TCCCATGGAT TAGTTTAGAT
CTTGGCTCGT TTGGTCCATT AACCATTGAG TATTTTCTTG GGGTTGATGG TCTTTCGATT
ACGATGATTA TTTTAACGGC ATTAGTTTCA GCTATTGGTG TTCTTTCGAG TTGGACTATT
CAAAAGCAAG TCAAAGGATA TTTTATTCTC TATAATATTC TTGCAACGGC AATGATGGGC
TGTTTTGTAG CTCTTGATTT CTTTCTCTTT TATGTATTCT GGGAAGTGAT GTTGTTGCCG
ATGTACTTCC TTATTGGTAT TTGGGGCGGA CCTAATCGTG AATATGCGGC TATCAAATTC
TTCCTCTATA CCTTGTTTGG TTCGGTATTT ATGTTGTTAG TGATGATTGG CCTTTACTTT
AGTGTTATTG ATCCACTTAC TGGTAACCAT ACGTTTAGTC TTGTTGCAAT GGCAAGCCAA
GAGAATTATG TAAAAGGTGC TATTCTTGGT CCCGATAGTG TTTTCTGGCG TTATGCAGCT
TTCATTGTGC TTTTTGTTGG TTTTGCTATT AAAGTTCCAA TGTTTCCATT CCATACGTGG
TTACCTGATG CACACGTTGA AGCGCCAACC CCTATTTCAG TTATTCTTGC TGGTGTGTTG
CTGAAACTTG GTACTTATGG AATGATGCGT ATTAATTTTC CTCTCTTTCC TGAGGTGTTT
CAAGCATCGC TTTATGTGAT AGGTATTTTT GGTGCTATTA ATATTATTTA TGGCGCATTC
TGTGCATTAG CTCAAAAAGA CTTAAAAAAG ATGGTGGCTT ATTCATCCAT TAGTCACATG
GGTTATGTAT TGCTTGGGCT TGCCGCTGGT AATAGCGAGG GAATGCTTGG TGCGCTTTAC
CAAATGTTTA ACCATGGCAC CATCACCGCA ATGCTCTTTT TATTGGTAGG TGTTATTTAT
GATCGTGCGC ACTCTCGCCA AATTGAGAAG TTTGGTGGAC TTGCTACCTA TATGCCAGTT
TATGCTGCCT TTGTAACAGT TGCATGGTTT GCTTCACTTG GCTTACCAGG GCTGAGTGGT
TTTATTTCAG AAGCTTTTGT GTTTGTTGGG GCTTTTAGTG CCGAAGTTAC TCGTCCTATT
GCAATTGTTT CTGTGCTTGG TATTGTGTTT GGTGCAGCCT ACTTACTTTG GTCGTTACAA
AGAATGTTCC TTGGTCAAAG AAGAGCTGAT GCACTCTATG ATGTTGTAGA GGATGAGCAT
GGACATAAGC ACATTCATTT TCATGATTGG AATGGTAAGC TGGATCTTGA TGCGCGTGAA
TTAACAATGC TTGTTCCGCT TGTTATTATC ACCATTTTCC TTGGTGTTTA TCCAATGCCA
ATCATGGGTT TATTGACTTC AAGCATCAAT AAACTTGTGC AAGTGCTTTC TCCTGTTGTT
TTATCGCAGA TGTAA
 
Protein sequence
MLSLIVFLPL IAGLIILAVP ASQKQVIRIV SLLAALVQMV LAVMIWRDYD PSLAGITAGA 
GGTLAGSFQF VERLPWISLD LGSFGPLTIE YFLGVDGLSI TMIILTALVS AIGVLSSWTI
QKQVKGYFIL YNILATAMMG CFVALDFFLF YVFWEVMLLP MYFLIGIWGG PNREYAAIKF
FLYTLFGSVF MLLVMIGLYF SVIDPLTGNH TFSLVAMASQ ENYVKGAILG PDSVFWRYAA
FIVLFVGFAI KVPMFPFHTW LPDAHVEAPT PISVILAGVL LKLGTYGMMR INFPLFPEVF
QASLYVIGIF GAINIIYGAF CALAQKDLKK MVAYSSISHM GYVLLGLAAG NSEGMLGALY
QMFNHGTITA MLFLLVGVIY DRAHSRQIEK FGGLATYMPV YAAFVTVAWF ASLGLPGLSG
FISEAFVFVG AFSAEVTRPI AIVSVLGIVF GAAYLLWSLQ RMFLGQRRAD ALYDVVEDEH
GHKHIHFHDW NGKLDLDARE LTMLVPLVII TIFLGVYPMP IMGLLTSSIN KLVQVLSPVV
LSQM