Gene Cphamn1_1754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1754 
Symbol 
ID6375441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1895417 
End bp1897039 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content49% 
IMG OID642684247 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_001960153 
Protein GI189500683 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTCAA GAAAATACCC TGAACCTTCC GTAGTCAGGG AGGAGTTGAT AAAGAAATAT 
CCGCCCAAGG TTGCCAAAAA AAGAGGCAAA GCGATTGTCA TCAACGATCC TGAAACGATC
CCGCCGGTAC AGGCCAATAT CAGGACCATA CCCGGGATTA TCTCACAGAG GGGATGTTCT
TACGCAGGAT GTAAAGGGGT TGTTCTGGGC CCGACAAGAG ACATTGTCAA TCTGGTGCAC
GGACCTATCG GATGCAGTTT TTACGCCTGG CTGACCCGAA GAAACCAGAC GCGTCCTGAC
GGACCGGATG ATAAAAACTA TATGACCTAC TGCTTCTCAA CCGACATGCA GGAAGAACAT
GTTGTTTTCG GTGGAGAGAA AAAGCTGAAA GAGGCAATCC AGGAAGCCTA CGACATATTC
CGTCCAAAAG CTATCGGCAT TTTCTCAACC TGTCCGGTAG GCCTTATCGG AGATGACGTT
CACGCAGTAG CCAGAGAGAT GAAAGAAAAA CTTGGCGACT GCAATATTTT CGGCTTCAGC
TGTGAAGGCT ACAAAGGTGT CAGCCAGTCG GCCGGACACC ATATCGCCAA TAACCAGGTC
TTCAAACATG TTGTCGGCCT TGATGACACC GACAAGGGCG GAAAGTTCAA GATCAACATG
CTTGGTGAAT ACAATATCGG AGGTGACGCT TTCGAGATCG AGCGGCTGCT TGAAAAATGC
GGCATAACAA TGGTCGCGAG CTTCAGCGGC AACTCGACGG TCAACCAGTT TGAAAACTCT
CACACCGCCG ATCTGAACGT GATCATGTGC CACCGCTCGA TCAACTATAT GGCCGAGATG
ATGGAAACGA AATATGGCAT TCCCTGGATG AAAGTCAACT TCATCGGCGC GGAATCATCC
GCAAAATCAC TCCGCAGAAT CGCCAGGTAT TTTGAAGATG AAGAACTGAT GGCAAAGGTC
GAGCAGGTCA TAGCCGAGGA ACTGCCGGTC GTCCAGTCGG TGATCAACGA GATCTACCCG
AGAACAAAAG GTAAGCTCGC TATGCTCTTC GTCGGCGGGT CCAGGGCTCA CCACTATCAG
GAGCTGTTTG GTGAACTGGG TATGGAAACC ATCTCGGCAG GTTACGAGTT CGGACATCGG
GACGACTATG AAGGGCGGAA GGTCATTCCG AATATCAAAG TAGATGCTGA CAGCAAGAAC
ATCGAGGAGC TCAAGGTTAC CGCTGATCCG GAAAAATTCA AACCGAGAAA AACGGAAGAA
GAACTTGAAA AGCTCAAAGC TGAAGGCCTT GACATCAGGG AATACGAAGG CATGATGCCC
GATATGAAAA AAGGATCTAT CGTCATCGAT GATATCAGCC ATTACGAAAG CGAAAAGCTG
ATTGAACTCT ACAAACCTGA TATTTTCTGC GCCGGCATCA AGGAAAAATA TGTCGTGCAG
AAAATGGGCG TCCCCTTGAA ACAGCTTCAC AGCTATGACT ACGGGGGGCC TTATGCCGGA
TTTAAAGGCG CGATTAACTT TTACAGGGAT ATCGACAGAA TGGTCAACAG CCGGGTCTGG
AAACTTATCC AGGCGCCTTG GGAAGAAACG ACTGAGCTGG AAGCTAACTA TGTCACACAG
TAA
 
Protein sequence
MESRKYPEPS VVREELIKKY PPKVAKKRGK AIVINDPETI PPVQANIRTI PGIISQRGCS 
YAGCKGVVLG PTRDIVNLVH GPIGCSFYAW LTRRNQTRPD GPDDKNYMTY CFSTDMQEEH
VVFGGEKKLK EAIQEAYDIF RPKAIGIFST CPVGLIGDDV HAVAREMKEK LGDCNIFGFS
CEGYKGVSQS AGHHIANNQV FKHVVGLDDT DKGGKFKINM LGEYNIGGDA FEIERLLEKC
GITMVASFSG NSTVNQFENS HTADLNVIMC HRSINYMAEM METKYGIPWM KVNFIGAESS
AKSLRRIARY FEDEELMAKV EQVIAEELPV VQSVINEIYP RTKGKLAMLF VGGSRAHHYQ
ELFGELGMET ISAGYEFGHR DDYEGRKVIP NIKVDADSKN IEELKVTADP EKFKPRKTEE
ELEKLKAEGL DIREYEGMMP DMKKGSIVID DISHYESEKL IELYKPDIFC AGIKEKYVVQ
KMGVPLKQLH SYDYGGPYAG FKGAINFYRD IDRMVNSRVW KLIQAPWEET TELEANYVTQ