Gene Cphamn1_1757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1757 
Symbol 
ID6375444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1899986 
End bp1901359 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content49% 
IMG OID642684250 
ProductNitrogenase 
Protein accessionYP_001960156 
Protein GI189500686 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0799606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA CAGCAAAAGC AGCGACTCAG AACGCATGTA AACTCTGTAA TCCACTGGGA 
GCCTGCCTCG CGTTCAGAGG CATAGAAAAG TGCGTTCCCT TTCTTCACGG ATCACAGGGA
TGCGCAACCT ATATTCGACG ATACCTTATC AGCCATTTCA AGGAACCGGT TGATATCGCG
TCATCAAACT TCAATGAAGA TACGGCTGTA TTCGGCGGCA GCCACAACCT GCAACTGGGG
TTGAAAAATG TCACGCTCCA GTATAAACCT GAGGTCATCG GGCTGGCGAC GACCTGCCTG
TCGGAAACAA TCGGGGACGA TGTCGACATG ATCCTCCGCG ACTATGACAA ACTTTTTGAA
AACGGAGAAC CGTTACCCAA CGGAAAACCG CTCCCATTGA TGATCCATGC GTCAACCCCC
AGTTATCAGG GCAGCCACAT CGACGGGTTT CATGCCGCGG TTAAAGCGAC GGTTGAAACA
ATTGCTGAAA GCGGACAAAA AGAGAATCTT CTGAACCTCT ATCCCAACAT GGTTTCTCCC
GCGGACCTCA GACACATGAA GGAGATCCTC AAAGACTTCA ACATTCCCTA CGTCCTGCTG
CCTGACTATT CGGAGACTCT TGACGGAGGA CCGTGGGATG AATATCACAG AATTCCGAAA
GGCGGCACAA CGGTCAGCGC GATCAGAAAA AGCGGCAAGG CCGCTGCAAG TCTGGAATTC
TCATCGGTAC TGACCGCAGA CAAGTCAGCT GCCGTATATC TGGAAAAGAA GTTCGATGTA
CCTGCATATT CCATGACGTT GCCGATCGGC ATCAAACAGA GCGACGCGTT TTTCGGACTG
CTCGAAAAGC TCTCAGAGAC TCCTATGCCT GAAAAATATG AAGATGAGCG GAGAAGACTT
GTCGATGCTT ATGCAGACGG GCACAAGTAC ATTTTCGAGA AAAAAGCGAT TGTGTACGGT
GAAGAGGATC TGGTGATCGC CATGACTGCG TTTCTGACAG AGATCGGCAT CACTCCTGTA
CTGTGCGCTT CCGGAGGAAA AAGCGGTCAC CTGAAAAAAC GGATTGAAGA GATCGTTCCC
GACAGTGAAA ATACCGGCAT ACTCGTCCGT GATGGTGTTG ATTTTGTTGA TATCGAGGAT
GAGGCGAAAG TCCTGAAGCC CGATCTTCTC ATCGGCAACA GTAAAGGCTA CACCATGTCA
AGGAAAAACA ACACTCCCAT CATCAGGATA GGATTTCCTA TCCATGACCG GTTCGGAGGA
CAGCGTCAAC TTCATCTCGG TTATCGCGGG ACACAGGAAC TGTTCGACAG AATCGTCAAT
ACCATTCTTC AAGAGAGACA GAATTCATCA CCAATCGGAT ATACATACCA GTAA
 
Protein sequence
MKTTAKAATQ NACKLCNPLG ACLAFRGIEK CVPFLHGSQG CATYIRRYLI SHFKEPVDIA 
SSNFNEDTAV FGGSHNLQLG LKNVTLQYKP EVIGLATTCL SETIGDDVDM ILRDYDKLFE
NGEPLPNGKP LPLMIHASTP SYQGSHIDGF HAAVKATVET IAESGQKENL LNLYPNMVSP
ADLRHMKEIL KDFNIPYVLL PDYSETLDGG PWDEYHRIPK GGTTVSAIRK SGKAAASLEF
SSVLTADKSA AVYLEKKFDV PAYSMTLPIG IKQSDAFFGL LEKLSETPMP EKYEDERRRL
VDAYADGHKY IFEKKAIVYG EEDLVIAMTA FLTEIGITPV LCASGGKSGH LKKRIEEIVP
DSENTGILVR DGVDFVDIED EAKVLKPDLL IGNSKGYTMS RKNNTPIIRI GFPIHDRFGG
QRQLHLGYRG TQELFDRIVN TILQERQNSS PIGYTYQ