Gene Clim_0682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0682 
Symbol 
ID6354296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp755017 
End bp756657 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content53% 
IMG OID642668309 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_001942744 
Protein GI189346215 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGCGA ACATACATTT CCCAGATCCT TCCAAAGTCA GGGAGGAGCT GACACAAAGA 
TATCCGGCGA AGGTTGCCAA AAAACGCACG AAGTCGATTA TCATCAACGA CCCGGAAACC
ATACCGGAAG TACAGGCCAA CGTCCGTACC GTGCCTGGTA TCATTACGCA GCGCGGCTGC
TCCTATGCAG GCTGTAAAGG TGTTGTGCTC GGCCCGACAC GCGACATCGT CAACATCGTT
CACGGACCTA TCGGCTGCAG TTTCTATGCA TGGCTGACCC GCCGTAACCA GACAAGACCG
GAAACTCCGG AACATGAAAA CTATATCACC TACTGTTTCT CGACCGATAT GCAGGAAGAG
AATGTGGTGT TCGGCGGCGA AAAGAAGCTG AAACAGGCGA TCCAGGAGGC ATACGATCTT
TTTCATCCGA AATCCATCGC GATCTTCTCC ACCTGCCCGG TAGGTCTGAT CGGCGATGAC
GTGCATGCCG CATCAAAAGA GATGCGCGAA AAGTTCGGCG ACTGCAACGT CTTCGGATTC
AGCTGCGAAG GCTACCGCGG CGTCAGTCAG TCCGCAGGCC ACCATATAGC GAACAACGGT
GTATTCAAAC ACATGGTAGG ACGCAACAAC ACCCCGTCGG AAGGCAAATT CAAGCTGAAC
CTGCTCGGTG AGTACAACAT CGGCGGCGAC GCTTTTGAAA TCGAGCGCAT CTTCGAAAAA
GCGGGAATAA CTCTTGTGGC CTCCTTCAGC GGCAACTCGA CCGTCGGCCA GCTTGAAAAC
GCCCATACAG CCGATCTCAA CGTGATCATG TGTCACCGTT CGATCAACTA CATGGGCGAG
ATGATGGAAA CCAAATATGG CATCCCATGG ATGAAGGTGA ACTTCGTCGG TGCTGAATCG
ACCGCAAAGT CGCTGCGCAA GATCGCGGAA TATTTTGGAG ACGAAGAGCT GAAAGCCAGG
GTCGAAGAGG TCATTGCCGA AGAGATGCCG AAAGTGAAAG CCGTGATCGA CGATATCCGT
CCGAGAACTG AAGGAAAAAC CGCCATGCTC TTTGTCGGCG GATCGCGCGC CCATCACTAC
CAGGATCTCT TTACCGAGCT TGGCATGACT ACGGTAGCTG CCGGTTATGA ATTCGCTCAC
CGCGACGACT ATGAAGGGCG TCATGTGCTG CCGAGCATCA AGGTCGATGC CGACAGCAAG
AACATCGAAG AACTGAAAAT CGTAGCCGAC CCGGAACTCT ATCAGCCGAG AAAAACCGAA
GCCGAACTTG AAGCGCTCAA GGAGAAAGGA CTGGAGATCA ACGGCTATGA GGGCATGATG
AAGCAGATGA TGAAAAAATC GCTCGTCGTC GATGACGTCA GCCACTACGA ATCGGAAAAG
CTGATCGAAA TCTACAAGCC CGATATTTTC TGTGCCGGCA TCAAGGAGAA ATATGTAGTG
CAGAAAATGG GTGTGCCTCT CAAACAGCTG CACAGCTATG ATTACGGCGG TCCTTACACC
GGCTTCGTCG GCGCGACAAA CTTCTACAGG GATATCGACC GTATGGTCAA CAATCCGGTC
TGGAAGCTGA TCAAGGCCCC CTGGGAAACA GCAGGAAACG GAAAAGGCGC AGAACTCGAA
GCCACCTACG TCACACAGTA A
 
Protein sequence
MEANIHFPDP SKVREELTQR YPAKVAKKRT KSIIINDPET IPEVQANVRT VPGIITQRGC 
SYAGCKGVVL GPTRDIVNIV HGPIGCSFYA WLTRRNQTRP ETPEHENYIT YCFSTDMQEE
NVVFGGEKKL KQAIQEAYDL FHPKSIAIFS TCPVGLIGDD VHAASKEMRE KFGDCNVFGF
SCEGYRGVSQ SAGHHIANNG VFKHMVGRNN TPSEGKFKLN LLGEYNIGGD AFEIERIFEK
AGITLVASFS GNSTVGQLEN AHTADLNVIM CHRSINYMGE MMETKYGIPW MKVNFVGAES
TAKSLRKIAE YFGDEELKAR VEEVIAEEMP KVKAVIDDIR PRTEGKTAML FVGGSRAHHY
QDLFTELGMT TVAAGYEFAH RDDYEGRHVL PSIKVDADSK NIEELKIVAD PELYQPRKTE
AELEALKEKG LEINGYEGMM KQMMKKSLVV DDVSHYESEK LIEIYKPDIF CAGIKEKYVV
QKMGVPLKQL HSYDYGGPYT GFVGATNFYR DIDRMVNNPV WKLIKAPWET AGNGKGAELE
ATYVTQ