Gene Cphamn1_1756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1756 
Symbol 
ID6375443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1898628 
End bp1899989 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content48% 
IMG OID642684249 
Productnitrogenase MoFe cofactor biosynthesis protein NifE 
Protein accessionYP_001960155 
Protein GI189500685 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00502819 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAGC TTGAAATACT CGAGGGAAGA AAACAATCGG TCTTTGAAAA AAACAAAGAT 
GCGGGAGCAT TTGACATTTC CTGTGAAACT ACAAGCCTCT CAGGCTCTGT CAGTCAGAGA
GCATGTGTCT TCTGCGGATC ACGAGTCGTA CTCTACCCTG TTGCTGATGC CCTTCATATT
GTCCATGGCC CGATCGGATG TGCCGCCTAC ACGTGGGACA TCCGGGGGTC GGTATCTTCC
GGACCGGAAC TGCACCGTCT GAGTTTTTCT ACCGACCTGC AGGAAATGGA TGTCATTTAC
GGGGGCGAAA AAAAGCTCTA CAAATCCCTG ATTGAACTGA TTGACCAGTA CACACCGAAA
GCCGCGTTTA TCTATTCCAC CTGCATCATC GGACTGATCG GCGATGATAT CGACGCGGTC
TGCAAAAAAG TCTCGGAAGA GAAAGGAATT CCTGTTCTGC CGGTACATTC GGAAGGATTC
AAGGGCACCA AAAAAGATGG GTACAAAGCT GCCTGTGATG CCCTCATGAA AATCGTTGGC
ACCGGTTCAA CAGAAGGAAT AGGAAAGTAC AGCATCAATA TCATGGGTGA GTTTAACCTT
GCCGGTGAAG CCTGGATTAT CAGAAAATAC TATGAAGAAA TGGGAATAGA GGTTGTCTCT
ACCCAGACCG GCGACGGCAG GGTGGACGAT ATACGCCGAT CCCACGGGGC CGCACTCAAT
ATTGTGCAGT GCTCGGGCTC TATGGTCAAA CTGGCCAAAA TGATGGAGGA AAAATACGGC
ATTCCATACA TGAGAGTATC GTATTTCGGT ATTGAGGATA TGTCAAAAGC GCTTTACGAT
GTCGCGAACC ATTTCAGCGA TAATCCGGCA ATCATGGAAG CCGCTAAAAA TATCGTACGG
CGGGAAGTGG GGGAAATCTA CCCGCAGATC ATGAAGTATC GGGCAGCTCT CAGCGGTAAA
AAAGCCGCCA TCTATGTAGG AGGTGCGTTT AAGGCATTTT CACTGATCAA GGCACTCTCC
TCTGTCGGAA TGGATGTTGT CCTGGCCGGA TCGCAAACAG GGAACAAGGA TGACTATGCG
GGGCTGAAAG AAATGTGCGA TGAAGGAACG GTAATCGTTG ACGATTCCAA TCCGGTTGAA
CTTTCAAAAT TTGTTCTTGA AAAACAGGCA GACCTCCTTA TCGGAGGCGT AAAGGAACGT
CCGATAGCAT ATAAACTCGG TATAGGCTTC TGCGACCACA ATCATGAGCG CAAAATACCT
TTAGCCGGCT TTGTGGGAAT GGTGAACTTC GTCAAGGAGG TATACCAGTC CGTCATGAGC
CCGGTCTGGC AGTTTGCACC GAGAAAAGGA GGAAAATTAT GA
 
Protein sequence
MDKLEILEGR KQSVFEKNKD AGAFDISCET TSLSGSVSQR ACVFCGSRVV LYPVADALHI 
VHGPIGCAAY TWDIRGSVSS GPELHRLSFS TDLQEMDVIY GGEKKLYKSL IELIDQYTPK
AAFIYSTCII GLIGDDIDAV CKKVSEEKGI PVLPVHSEGF KGTKKDGYKA ACDALMKIVG
TGSTEGIGKY SINIMGEFNL AGEAWIIRKY YEEMGIEVVS TQTGDGRVDD IRRSHGAALN
IVQCSGSMVK LAKMMEEKYG IPYMRVSYFG IEDMSKALYD VANHFSDNPA IMEAAKNIVR
REVGEIYPQI MKYRAALSGK KAAIYVGGAF KAFSLIKALS SVGMDVVLAG SQTGNKDDYA
GLKEMCDEGT VIVDDSNPVE LSKFVLEKQA DLLIGGVKER PIAYKLGIGF CDHNHERKIP
LAGFVGMVNF VKEVYQSVMS PVWQFAPRKG GKL