Gene Clim_0680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0680 
Symbol 
ID6354294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp752139 
End bp753506 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content52% 
IMG OID642668307 
Productnitrogenase MoFe cofactor biosynthesis protein NifE 
Protein accessionYP_001942742 
Protein GI189346213 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0162365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAG AAATCGGGAT ACTCGAAGGA AGACAGGGCC AGGTCTTCGA AAAGAAAAGC 
GGCGAGGCAG AACAGCTCGA TATCTCTTGT GAGAAAACAA GCCTGTCCGG ATCGGTCAGT
CAGCGAGCCT GTGTGTTCTG CGGTTCCCGT GTGGTGCTCT ACCCTGTAGC CGATGCCCTT
CACCTCGTTC ACGGCCCTAT CGGATGCGCA GCCTATACCT GGGACATCCG CGGCGCGGTA
TCTTCAGGCC CGGAACTGCA CCGGTTGAGT TTCTCGACCG ACCTCGGAGA GATGGATGTG
ATCTACGGCG GTGAAAAGAA ACTCTATCTT TCACTTATCG AACTGATCGA CAAGTATAAG
CCCAAAGCGG CATTTATCTA CTCGACATGC ATTATCGGCC TTATCGGTGA CGATATCGAC
GCCGTGTGCA AAAAAGTGTC GAAAGAGACC GGCATTCCAG TTCTGCCGGT TCATTCCGAA
GGGTTCAAGG GAACCAAGAA AGACGGGTAT AAAGCTGCCT GCACCTCTCT CATGAAGCTG
GTAGGCACCG GCTCGATCGA AGGGATCAGT CCTTACAGCA TCAATATTCT CGGCGAATTC
AACCTTGCCG GCGAAGCATG GATCATCAGG GAATACTACG AAAAAATGGG CATCGAGGTT
GTTTCCACCA TGACCGGTGA CGGACGTGTC GACGCCGTAC GCCGTGCTCA CGGCGCTACG
CTCAACGTCG TGCAATGTTC CGGATCAATG ACCACACTTG CCAAAGAGAT GGAGGAAAAA
TACGGCATTC CCTATATGCG CGTCTCCTAC TTCGGCATCG AGGACATGTC CAAATCGCTC
TACGATGTCG CCAAACATTT CAGCGACCGG CCCGACATCA TGGATGCGGC AAAAGAGATT
GTCAGCAAAG AGGTAGCGAA ACTCTACCCC GAACTGCAAA AATTCAAAAA AGTCCTGGCG
GGCAAAAAAG CGGCCATATA TGTCGGTGGA GCATTCAAAA CCTTTTCGCT CATCAAGGCC
CTGCGTTCGA TCGGCATGTC GGTTGTCCTT GCCGGATCCC AGACAGGCAA CAAGGATGAT
TACGAGCGCC TCAGGGAGAT GTGCGACGAA GGAACCATCA TCGTTGACGA CTCGAATCCC
GTCGAACTCT CGAAATTCGT GCTTGAAAAA GAGGCCGACC TGCTTATCGG TGGGGTGAAG
GAGCGGCCGA TCGCCTACAA ACTCGGTATC GGCTTCTGCG ATCACAACCA CGAGAGAAAA
ATTCCTCTGG CAGGATTTAT CGGCATGTAC AACTTCGCAA AGGAGGTCTA TCAGTCGGTC
ATGAGCCCGG TATGGCAGTT CGCTCCGAGA AAAGGAGGCA AAATATGA
 
Protein sequence
MKEEIGILEG RQGQVFEKKS GEAEQLDISC EKTSLSGSVS QRACVFCGSR VVLYPVADAL 
HLVHGPIGCA AYTWDIRGAV SSGPELHRLS FSTDLGEMDV IYGGEKKLYL SLIELIDKYK
PKAAFIYSTC IIGLIGDDID AVCKKVSKET GIPVLPVHSE GFKGTKKDGY KAACTSLMKL
VGTGSIEGIS PYSINILGEF NLAGEAWIIR EYYEKMGIEV VSTMTGDGRV DAVRRAHGAT
LNVVQCSGSM TTLAKEMEEK YGIPYMRVSY FGIEDMSKSL YDVAKHFSDR PDIMDAAKEI
VSKEVAKLYP ELQKFKKVLA GKKAAIYVGG AFKTFSLIKA LRSIGMSVVL AGSQTGNKDD
YERLREMCDE GTIIVDDSNP VELSKFVLEK EADLLIGGVK ERPIAYKLGI GFCDHNHERK
IPLAGFIGMY NFAKEVYQSV MSPVWQFAPR KGGKI