Gene Clim_2132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2132 
Symbol 
ID6355926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2350992 
End bp2352035 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content53% 
IMG OID642669723 
ProductTIM-barrel protein, nifR3 family 
Protein accessionYP_001944135 
Protein GI189347606 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATAG GTTCAATAGA CATAGATCGT CCCGTTATTC TCGCTCCGAT GGAGGATGTG 
ACCGACCGTG CTTTCCGACA GCTCTGCAAG CGCTGCGGTG CGGATATTGT CTATACCGAG
TTCATCAGCG CCGAGGCGTT GCGCCGAGGA GTGGAAAAGA CCATGCGCAA ACTTGCCGTC
GATGAGATCG AACGCCCTGT TGCCGTGCAG ATTTTCGGCA GTACGGTTGA GTCTATGATC
GAGGCTGCGG CTGTTGCCGA AACCTACAAT CCGGATTTTC TCGATATTAA TTTCGGCTGT
CCGACCAAAA AAGTGGCAGG CAAGGGGGCA GGAGCCGCTC TGCTCAGGGA GCCTGAAAAA
ATGCAGGCCA TAGCAGAAGC TGTAGTCAAA AGCGTCGGTA TACCGGTTAC GGCTAAAACC
CGGATCGGCT GGGATCACGA TTCCATCAAT ATTATTGAGG TGCTTCACCG GCTTGAAGAT
GCCGGTATCC GGGCGCTTGC CATCCATGGC CGCACCCGGA GCGACATGTA TAAAGGCAGG
GCAGATCGGG AGCGGATTGC CGAGGCGAAG CGGGAGGCAT CGATACCGGT GATTGCCAAT
GGTGATATCT GGTCGGCGGA GGATGCCCTT GCCATGTTCG ATCAAACCGG TGCCGACGGG
GTCATGATCG GCAGAGGGTC GATCGGTAAT CCATTTATAT TCAGTCAGGC AAAGCACCTG
ATCCGGACAG GAACACCGGC TCCTCCGCCG GGATTTCGGG AGAGGATACA TGCTGCGATA
GAACATCTCA AACTCTCTGT TCAGTATAAA GGCGAGAAAT ACGGTAATCT CGAAATGCGC
AGGCACTATG CCACCTATCT TAAAGGCTTG CCGAAAGTGT CGCAAGTGCG CAATAAACTC
GTTCGGGAAG ATGGGTGGGA GCATATTGTC GAAATTCTTC TCGCCTATGA AGTCGAATGT
GAAGGGTATG CGAAGGAGGG ACGCATCAAA GAGTATGCGG AGTACCTCAA CGACCATTCA
GGAAAACTTG TTTTGAATTA TTAA
 
Protein sequence
MKIGSIDIDR PVILAPMEDV TDRAFRQLCK RCGADIVYTE FISAEALRRG VEKTMRKLAV 
DEIERPVAVQ IFGSTVESMI EAAAVAETYN PDFLDINFGC PTKKVAGKGA GAALLREPEK
MQAIAEAVVK SVGIPVTAKT RIGWDHDSIN IIEVLHRLED AGIRALAIHG RTRSDMYKGR
ADRERIAEAK REASIPVIAN GDIWSAEDAL AMFDQTGADG VMIGRGSIGN PFIFSQAKHL
IRTGTPAPPP GFRERIHAAI EHLKLSVQYK GEKYGNLEMR RHYATYLKGL PKVSQVRNKL
VREDGWEHIV EILLAYEVEC EGYAKEGRIK EYAEYLNDHS GKLVLNY