Gene Cpha266_0741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0741 
Symbol 
ID4569953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp842766 
End bp844406 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content50% 
IMG OID639765337 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_911218 
Protein GI119356574 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTCGT ACAAAAATTT ACCGGATCCA TCCCTGGTCA GGGAGGATCT GATACAGAAA 
TACCCGACCA AAGTTGCAAA AAAACGCAAC AAGTCAATTG TAATCAATGA CCCGGAGACC
ATTCCGGAGG TGCAGGCAAA CGTACGCACC GTTCCGGGAA TCATTACCCA GCGCGGCTGC
TGCTATGCCG GTTGTAAAGG TGTTGTTCTT GGACCGACGC GTGATATCGT CAACATTGTT
CACGGACCTA TCGGGTGCAG CTTCTATGCC TGGTTGACCC GTCGGAACCA GACGAGACCG
GAAACACCGG AAGCCGAGAA CTATATCACC TACTGTTTCT CAACCGACAT GCAGGAGGAG
AACGTTGTGT TCGGCGGCGA AAAGAAACTG AAGCAGGCCA TACAGGAAGC CTATGATCTC
TTTCATCCGA AGGCTATCGC TATCTTTTCG ACCTGTCCGG TTGGTCTTAT TGGCGACGAT
GTGCATGCCG CATCAAAAGA GATGCGTGAT AAATTCGGCG ACTGTAACGT TTTCGGGTTC
AGTTGTGAAG GGTACCGGGG TGTCAGCCAG TCGGCAGGCC ACCATATTGC CAACAACGGC
GTTTTCAAAC ACATGGTAGG ACGCAACAAC GCAGTCAAAG AGGGAAAGTT CAAATTAAAC
CTGCTTGGTG AATACAATAT TGGCGGTGAT GCGTTTGAAA TCGAGCGCAT ATTCGAAAGA
ACTGGCATCA CGCTTGTGGC ATCATTCAGC GGCAACTCGA CTGTCGGTCA GATTGAAAAT
GCTCATACAG CCGATCTTAA CGTGATTCTC TGTCACCGGT CGATCAACTA CATGGGCGAG
ATGATGGAAA CCAAATACGG TATCCCGTGG ATGAAAGTGA ACTTTGTCGG CGCTGAATCC
ACAGCCAAAT CACTCAGAAA AATTGCTGAA TATTTTGGCG ATGAAGAGCT GAAAGCCCGG
GTTGAAGCGG TAATTGCCGA AGAGATGCCA AAGGTGAAAG CGGTAATTGA TGAAATCAGA
CCAAGAACCG AAGGCAAGAC CGCCATGCTT TTTGTTGGCG GGTCAAGGGC TCACCACTAT
CAGGATCTTT TCAGCGAGCT TGGAATGACA ACGGTAGCAG CAGGGTATGA ATTCGCACAC
CGCGACGACT ATGAAGGGCG CCATGTTCTG CCCGGCATAA AAATCGATGC CGACAGCAAG
AACATCGAGG AGCTTAAAGT CACTGCGGAT CCGGAACTCT ACAATCCGAG AAAAAGTGAA
GCCGAGCTTG AGGCGCTGAA AGAAAAAGGA CTCGAGATCA ACGGTTACGA AGGAATGATG
AAGCAGATGC TGAAAAAAAC GCTCGTTGTT GACGACGTCA GCCACTATGA ATCGGAAAGA
CTGATCGAGA TCTACAAGCC GGATATCTTC TGTGCAGGCA TCAAGGAGAA ATATGTCGTG
CAGAAAATGG GCGTTCCGCT CAAGCAGCTT CACAGCTATG ACTACGGCGG TCCTTACACC
GGTTTTGAAG GCGCACAGAA CTTCTACCGG GATATCGACC GGATGGTGAA CAATCCCGTC
TGGAAGCTCA TCAAGGCCCC GTGGCAGAAA GCGGAAAACG GATCATCGAC AGCATTGGAA
GCGAGTTACG TCACTCACTA A
 
Protein sequence
MQSYKNLPDP SLVREDLIQK YPTKVAKKRN KSIVINDPET IPEVQANVRT VPGIITQRGC 
CYAGCKGVVL GPTRDIVNIV HGPIGCSFYA WLTRRNQTRP ETPEAENYIT YCFSTDMQEE
NVVFGGEKKL KQAIQEAYDL FHPKAIAIFS TCPVGLIGDD VHAASKEMRD KFGDCNVFGF
SCEGYRGVSQ SAGHHIANNG VFKHMVGRNN AVKEGKFKLN LLGEYNIGGD AFEIERIFER
TGITLVASFS GNSTVGQIEN AHTADLNVIL CHRSINYMGE MMETKYGIPW MKVNFVGAES
TAKSLRKIAE YFGDEELKAR VEAVIAEEMP KVKAVIDEIR PRTEGKTAML FVGGSRAHHY
QDLFSELGMT TVAAGYEFAH RDDYEGRHVL PGIKIDADSK NIEELKVTAD PELYNPRKSE
AELEALKEKG LEINGYEGMM KQMLKKTLVV DDVSHYESER LIEIYKPDIF CAGIKEKYVV
QKMGVPLKQL HSYDYGGPYT GFEGAQNFYR DIDRMVNNPV WKLIKAPWQK AENGSSTALE
ASYVTH