Gene Cag_1247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1247 
Symbol 
ID3748285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1708852 
End bp1710486 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content46% 
IMG OID637773785 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_379551 
Protein GI78189213 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0023858 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGAAA AACTTATGAC ATCCGACCCA GCGCAAGTGC GGGAGACGTT GATACAAAAA 
TATCCACCAA AGGTGGCTAA AAAGCGAGCA AAGTCCATTG TTATTAATGA CCCTGAAATA
GTACCCGAAG TACAAGCTAA CGTAAGAACG GTACCGGGCA TTATTACACA ACGTGGTTGT
GCGTATGCTG GTTGTAAAGG TGTGGTGCTT GGTCCAACAC GCGACATTGT CAATATAGTA
CACGGTCCAA TTGGATGCAG CTTTTATGCG TGGTTAACCC GCCGTAACCA AACGCGCCCC
GAAAGTCCAG AACATGCCAA CTACATCACC TACTGTTTTT CAACCGATAT GCAGGAAGAA
AACGTGGTGT TTGGTGGTGA GAAAAAACTC AAAGTGGCAA TTCAAGAGGC TTATGACCTC
TTCCACCCAA AATCAATTGC TATTTTTTCA ACCTGCCCTG TAGGTTTAAT TGGTGATGAC
GTTCATGCGG CAGCAAGAGA GATGAAGGAG AAGTTTGGCG ACTGCAACGT CTTTGGTTTT
AGTTGCGAAG GGTATCGCGG CGTTAGTCAA TCAGCAGGAC ACCACGTTGC CAACAACGGT
GTTTTTAAGC ATATGGTTGG TCGCGACAAC ACGGTAAAGC CCGGCAAATT CAAGCTTAAC
CTGCTTGGTG AGTACAACAT TGGCGGCGAC GCTTTTGAGC TTGAGCGCAT TTTCGAGAGA
GTTGGTATTA CGTTAGTTGC CTCGTTTAGT GGCAACTCAA CGGTTGGTGC GTTGGAAAAC
TCACACACCG CCGACCTCAA CATTATTATG TGCCATCGCT CCATTAACTA CATGGGCGAT
ATGATGGAGA CGAAATACGG TATTCCGTGG ATGAAGGTGA ACTTTGTAGG TGCCCAATCA
ACCGCCAAAT CGTTACGCAA AATTGGTGAA TACTTTGGTG ATGAAGAGCT GAAAGCGCGT
ATTGAAGCGG TGATTGCCGA AGAGATGCCA AAAGTGGAAG CCGTTATTAA TGAAATTCGT
CCACGCACCG AAGGCAAAAC TGCCATGCTC TTTGTAGGTG GCTCACGAGC GCACCACTAC
CAAGATCTCT TTACCGAGCT TGGCATGACA ACAATTGCAG CAGGTTACGA ATTTGCTCAC
CGCGACGACT ACGAAGGGCG CGAAGTGCTA CCAAAAATCA AAATTGATGC CGACAGCAAA
AACATTGAAG AGCTGAAGGT TGAAGCCGAT CCCGAGCTTT ACAAACAAAG AAAAAGTGAA
GCTGAGCTTG AAGAGCTAAA GGCAAAAGGA TTAGAGATTA ATGGCTACGA AGGCATGATG
AAGCAGATGA CGAAAAAGTC GCTTGTGGTG GACGATGTAA GCCACTATGA ATCTGAAATG
CTGATTGAAA TGTACAAGCC CGACATTTTC TGTGCTGGTA TTAAAGAGAA ATATGTGGTG
CAAAAAATGG GCGTGCCGCT CAAACAGCTT CATAGCTACG ACTACGGCGG ACCTTACACA
GGTTTTGAAG GCGCACTTAA CTTCTACCGC GATATTGACC GAATGGTAAA CAATCCTGTT
TGGAAGCTTA TTAAAGCTCC ATGGGAAAAA GCTGAAAATG GCGGAGTACT TGAAGCCGCT
TACGTTCAAG GATAA
 
Protein sequence
MEEKLMTSDP AQVRETLIQK YPPKVAKKRA KSIVINDPEI VPEVQANVRT VPGIITQRGC 
AYAGCKGVVL GPTRDIVNIV HGPIGCSFYA WLTRRNQTRP ESPEHANYIT YCFSTDMQEE
NVVFGGEKKL KVAIQEAYDL FHPKSIAIFS TCPVGLIGDD VHAAAREMKE KFGDCNVFGF
SCEGYRGVSQ SAGHHVANNG VFKHMVGRDN TVKPGKFKLN LLGEYNIGGD AFELERIFER
VGITLVASFS GNSTVGALEN SHTADLNIIM CHRSINYMGD MMETKYGIPW MKVNFVGAQS
TAKSLRKIGE YFGDEELKAR IEAVIAEEMP KVEAVINEIR PRTEGKTAML FVGGSRAHHY
QDLFTELGMT TIAAGYEFAH RDDYEGREVL PKIKIDADSK NIEELKVEAD PELYKQRKSE
AELEELKAKG LEINGYEGMM KQMTKKSLVV DDVSHYESEM LIEMYKPDIF CAGIKEKYVV
QKMGVPLKQL HSYDYGGPYT GFEGALNFYR DIDRMVNNPV WKLIKAPWEK AENGGVLEAA
YVQG