Gene Cpha266_0738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0738 
Symbol 
ID4569932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp838633 
End bp839985 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content51% 
IMG OID639765335 
Productnitrogenase 
Protein accessionYP_911216 
Protein GI119356572 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACATG CAAAAACAGC AACACAAAAC GCCTGCAAAC TGTGCAACCC GCTCGGGGCA 
TGCCTTGCTT TCAGGGGAAT CGAAAACTGC GTACCCTTCC TGCACGGTTC ACAAGGTTGC
GCAACCTATA TCCGTCGTTA CCTGATAAGC CATTACAAGG AACCGATCGA TATCGCTTCG
TCGAACTTCA ATGAAGAAAC CGCCGTGTTT GGAGGCAGTC ATAACCTGCA GCTTGGACTG
AAAAACGTCA CCGAGCAGTA CAAGCCTGAG GTTATCGGAC TGGCCACAAC ATGCCTGAGC
GAAACCATCG GGGATGATGT GCCGATGATC CTTCGCGACT ATAAAAAAGC GTTTAAAAAC
GGTACGCCAA TGCCGATAAT GATTCATGCC TCAACGCCAA GCTATCAGGG AAGCCACATC
GACGGCTTTC ATGCCGCTGT CAGGGCAAGC GTTAAAACCC TTGCTGAAAA AGGGGCACGA
AAAAACCTGA TCAATATCTT CCCGAACATG ATCTCGCCGG CGGATATTCG TTACATCAAG
GAGATTCTCT CCGATTTCAG CGTGCCCTAT ATGCTCCTGC CAGACTACTC ACAGACGATG
GACGGCGGGC CATGGGGCGA ATACCACCGC ATCCCGCCAG GAGGGACACC TGCCGGAGCC
ATTGCCGGTG CAGGATGCGC AACGGCAAGT ATCGAGTTCG GCTCTACCCT TGAATCCTCA
AAATCTGCCG CCGGCTACCT TGAGGAGACA TTCGACGTTC CCCGTTACCC TCTTGCCCTG
CCGATCGGAA TAAACGAGAG CGACAGACTG TTCAACCTGC TTGAAAAACT GACCGAACAG
AAAATGCCGG AAAAATATGA GGATGAACGA CGCCGTCTGG TTGACGCTTA TGCCGATGGG
CACAAATATG TTTTTGAGAA AAAGGTCATT CTGTACGGAG AGGAAGACCT GGTGATCTCC
ATGGCAGCGT TTCTGCGTGA AATCGGCATG ACCCCCGTAC TCTGCGCGTC GGGAGGCAAA
AGCGGTCTGA TGAGAAAAAA GCTTCTTGAA CTGATTCCCG ACCTTGAGGA ACAGGGCATC
AGGATACGTG ACGGAGTGGA CTTTGTCGAT ATTGAAGACG AAGCCAAAGT ACTGCTCCCT
GACCTCCTGA TCGGCAATAG TAAAGGCTTT ACGATGGCAC GAAAAAACAA CATTCCGCTG
ATGAGAATCG GCTTTCCGAT TCATGACCGG TTCGGTGGAC AGCGAATGCA TCATATCGGG
TATCGCGGTA CCCAGGAACT CTTTGACCGG ATAGTCAACA CCGTTATCGA ACAACGGCAG
AATGCTTCAT CAATAGGCTA TACCTACATG TAA
 
Protein sequence
MKHAKTATQN ACKLCNPLGA CLAFRGIENC VPFLHGSQGC ATYIRRYLIS HYKEPIDIAS 
SNFNEETAVF GGSHNLQLGL KNVTEQYKPE VIGLATTCLS ETIGDDVPMI LRDYKKAFKN
GTPMPIMIHA STPSYQGSHI DGFHAAVRAS VKTLAEKGAR KNLINIFPNM ISPADIRYIK
EILSDFSVPY MLLPDYSQTM DGGPWGEYHR IPPGGTPAGA IAGAGCATAS IEFGSTLESS
KSAAGYLEET FDVPRYPLAL PIGINESDRL FNLLEKLTEQ KMPEKYEDER RRLVDAYADG
HKYVFEKKVI LYGEEDLVIS MAAFLREIGM TPVLCASGGK SGLMRKKLLE LIPDLEEQGI
RIRDGVDFVD IEDEAKVLLP DLLIGNSKGF TMARKNNIPL MRIGFPIHDR FGGQRMHHIG
YRGTQELFDR IVNTVIEQRQ NASSIGYTYM