Gene Cphamn1_1758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1758 
Symbol 
ID6375445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1901371 
End bp1902645 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content51% 
IMG OID642684251 
Productnitrogenase cofactor biosynthesis protein NifB 
Protein accessionYP_001960157 
Protein GI189500687 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR01290] nitrogenase cofactor biosynthesis protein NifB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0688077 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTAT CCATAGAAAA ACATCCCTGT TTTAACGATG CATCGAGACA CAGTTTCGGC 
CGCATTCACC TCCCTGTTGC CCCGAAATGC AATATCCAGT GCAACTACTG CAACAGAAAA
TTCGACTGTC TGAACGAAAA CAGACCGGGA GTGACCAGCA GGGTGCTCTC ACCCCATCAG
GCACTGCACT ATCTTGATCA GGCGCTTGAG CTCTCTCCGA ACATTGCCGT TGTCGGGATT
GCCGGCCCGG GGGATCCGTT CGCAAACCCT GAAGAAACCA TGACAACCCT GAGACTTGTT
CGTGAAAAAT ATCCGGAAAT GCTGCTCTGC GTTGCCAGCA ACGGGTTGAA CGTTCTGCCA
TATATCGAAG AACTTGCAGA ACTCAAGGTC AGCCATGTTA CCCTGACCAT CAATGCTATA
GATCCGGAAA TCGGTGCTGA AATCTATGCC TGGGTACGAC ACGGAAAAAA AGTGTTCCGT
GACGTTGCGG GAGCGGAACT GCTTCTCAAG AACCAGCTTG AAGCACTGAA AAAGCTCAAG
GAACTTGGTG TGACGGCGAA GGTCAACTCC ATTATCATTC CAGGCATCAA CGACAAGCAT
GTTGTCGAGG TTGCAAAAGC CGTATCGGAA CTCGGCGCCG ACATTTTCAA CGGCCTTTCA
TACTACAGGA CCGAAGAAAC CGTTTTCGAA AACATTCCTG AACCACATCC TGAACTGGTT
TTAGCGTTAC AGAAAGAAGC ATCGAACTAT CTGCCGCAGA TGCAACACTG TGCTCGCTGC
AGGGCAGACG CTGTGGGCAT CATCGGAGAG GAAAACAATG ACAGCATCAT GAAAGAACTG
ATAGAAGCAG CGAAGCTGCC GAAGAATCCA TCGGAAAACA GGCCCTTTGT CGCTGTTGCG
AGCATGGAGG GCGTGCTGAT AAACCAGCAT CTCGGAGAAG CCGATCGTTT TCTGATCTAT
GCATTGGATG AGAAGAGCGA GAAACCACTC CTGGTCGAGT CACGTCCGGC ACCGCCTACC
GGAGGAGGAA CCATGCGCTG GGAAGCCGTT TCCTCCATGC TTCTGGACTG CAAAGCGCTG
CTGGTCAACG GGGCTGGAGA GTCACCTAAA AAAGTGCTCT CCGACAGCGG TATCGAAATC
TACGTTCTTG ACGGGCTTAT CGAGGAGGGG GTTTCCGGAG TGTTTTGCGG AAAAGATATG
AGCCGGATGA CACGTATAAG CCAGATGCAT GCCTGCAAAA CAAGCTGTTC AGGAACCGGC
GGAGGATGCG GGTAA
 
Protein sequence
MTLSIEKHPC FNDASRHSFG RIHLPVAPKC NIQCNYCNRK FDCLNENRPG VTSRVLSPHQ 
ALHYLDQALE LSPNIAVVGI AGPGDPFANP EETMTTLRLV REKYPEMLLC VASNGLNVLP
YIEELAELKV SHVTLTINAI DPEIGAEIYA WVRHGKKVFR DVAGAELLLK NQLEALKKLK
ELGVTAKVNS IIIPGINDKH VVEVAKAVSE LGADIFNGLS YYRTEETVFE NIPEPHPELV
LALQKEASNY LPQMQHCARC RADAVGIIGE ENNDSIMKEL IEAAKLPKNP SENRPFVAVA
SMEGVLINQH LGEADRFLIY ALDEKSEKPL LVESRPAPPT GGGTMRWEAV SSMLLDCKAL
LVNGAGESPK KVLSDSGIEI YVLDGLIEEG VSGVFCGKDM SRMTRISQMH ACKTSCSGTG
GGCG