Gene Cag_1251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1251 
Symbol 
ID3748289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1714963 
End bp1716234 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content49% 
IMG OID637773789 
Productnitrogenase cofactor biosynthesis protein NifB 
Protein accessionYP_379555 
Protein GI78189217 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR01290] nitrogenase cofactor biosynthesis protein NifB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000233074 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAG ATATTACCAA ACACCCCTGT TTTAACGATT CAGCTCGCCA CACCTTTGGG 
CGAATTCACC TCCCTGTTGC GCCTAAGTGC AACATTCAGT GCAACTATTG CAGTCGCAAA
TTCGACTGCA TGAACGAGAA CCGTCCGGGA GTTACCAGCA AAGTGCTCTC GCCCCAGCAA
GCACTCTACT ACTTAGATCA AGCAATGGAG CTTTCACCAA ACATTGCTGT TGTAGGCATT
GCAGGACCAG GCGATCCCTT TGCTAACCCC GATGAAACAA TGGAAACCTT GCGCCTTGTT
CGGGCAAAAT ATCCCGAAAT GCTGCTATGC GTTGCCACCA ACGGGCTTGA TTTGCTACCA
TATATTGACG AGCTGGCTCG CTTGCAAGTA AGCCATGTAA CCATCACCAT TAACGCCATT
GATCCTGAAA TTGGTCAAGA AATTTATGCG TGGGTGCGTT ACAACAAAAA AATGTACAGA
GGGAAGGATG CCGCAAAAGT GCTCATCAAC AACCAGCTTG AAGCTCTCAA ACGGTTAAAA
GAGGTGGGCG TTACAGCAAA AGTAAACTCC ATTATTATTC CCGGCATTAA TGATGCGCAC
GTTATTACCG TAGCCTCCAA AGTAGCCGAA TTGGGAGCCG ACATCTTGAA CTGCTTACCC
TACTACAACA CTAAAGAGAC CGTTTTTGAA AATATAGATG AACCATCGCC TGAGCTTGTT
TTTGAAATTC AAAAAGCCAC CAGCGAATTT TTACCCCAAA TGAAACACTG CGCCCGCTGC
CGCGCCGATG CAGTTGGCAT TATTGGTGAA ATCAACTCAC CCGAAATTAT GGAGAAAATG
GCAGAAGTAG CCGCAATGGC GAAAAACCCA TTTGAGCAGC GCCCCTACAT AGCCGTTGCA
AGCATGGAAG GCGTATTGGT AAACCAACAT CTTGGCGAAG CCGACCGCCT TTTAGTGTAT
GGCATAGACG AACAAGGCGA CTGCGTTTTA GTGGATTCAC GCCAAACACC ACCAGCAGGC
GGCGGCAACG AACGGTGGGA AGCCCTTGCC AACTTACTCA GCGACTGCCG CACCGTATTA
GTAAGCGGAA TCGGCAACTC GCCCAAAAAA GTACTCAACA ACAACGGCGT TGAAGTGCTT
GTTATGGAAG GCGTTATAGC CGAAGCCGTT TACGCACTCT TTAACGGACA CGACATGCGC
CACCTTATAA AAACCGAACT TGCCCACGCA TGTGGCACAA ACTGTTCAGG CACAGGCGCC
GGCTGCGGTT AA
 
Protein sequence
MKQDITKHPC FNDSARHTFG RIHLPVAPKC NIQCNYCSRK FDCMNENRPG VTSKVLSPQQ 
ALYYLDQAME LSPNIAVVGI AGPGDPFANP DETMETLRLV RAKYPEMLLC VATNGLDLLP
YIDELARLQV SHVTITINAI DPEIGQEIYA WVRYNKKMYR GKDAAKVLIN NQLEALKRLK
EVGVTAKVNS IIIPGINDAH VITVASKVAE LGADILNCLP YYNTKETVFE NIDEPSPELV
FEIQKATSEF LPQMKHCARC RADAVGIIGE INSPEIMEKM AEVAAMAKNP FEQRPYIAVA
SMEGVLVNQH LGEADRLLVY GIDEQGDCVL VDSRQTPPAG GGNERWEALA NLLSDCRTVL
VSGIGNSPKK VLNNNGVEVL VMEGVIAEAV YALFNGHDMR HLIKTELAHA CGTNCSGTGA
GCG