Gene Clim_1707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1707 
SymbolpurT 
ID6353769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1877842 
End bp1879041 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content57% 
IMG OID642669312 
Productphosphoribosylglycinamide formyltransferase 2 
Protein accessionYP_001943728 
Protein GI189347199 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) 
TIGRFAM ID[TIGR01142] phosphoribosylglycinamide formyltransferase 2 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0749454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAA CAATCATGCT GCTCGGCAGC GGAGAACTGG GCAGGGAGTT CGTTATTGCG 
GCAAAACGTC TTGGGCAATA CGTGATTGCT GTCGACAGCT ATAACAATGC GCCGGCGCAG
CAGGTTGCCG ACGAGCGCGA AGTGATCGAC ATGCTGGACG GCAATGCTCT CGATGCTTTG
GTGGCCAGGC ACCGACCCGA TATGATCGTG CCTGAAATCG AGGCCATCCG CACCGAACGA
TTCTACGACT ATGAGGAGCA GGGAATACAG GTGGTGCCTT CGGCACGTGC CGCGAATTTT
ACGATGAATC GGAAGGCCAT TCGTGATCTC GCTTCAAAGG AGCTTGGCCT TCGTACTGCC
AGATACCGAT ACGCGGCTTC TCTCGAAGAA CTGCGGACTT CCGTTTCGGA GGTGGGAATT
CCCTGCGTGG TGAAACCGCT GATGAGCTCG TCGGGCAAGG GGCAGTCAAC GGTTAAAACA
GAAGAGGATA TTGAACGCGC ATGGAGCTAT TCGCAGAGCG GTCGGCGCGG GGATATTGCC
GAAGTGATCG TGGAGGCTTT TGTGCCGTTT CATACCGAGA TCACCCTGTT GACCGTAACG
CAGAAAAACG GCCCGACGCT GTTCTGCCCG CCCATAGGGC ACCGTCAGGA GCGGGGCGAT
TATCAGGAGA GCTGGCAGCC CTGCCGAATC GCGGATGCGC AGTTGCATGA GGCTCGGGAG
ATCGCTGAAA ACGTAACTCA TTCGCTGACA GGCGCGGGTA TCTGGGGTGT GGAGTTTTTC
CTTGCCGATG ACGGGCTCTA TTTTTCGGAA CTCTCGCCCC GTCCGCACGA TACCGGCATG
GTGACGCTGG CTGGTACGCA GAATCTCACG GAGTTCGAGC TTCATGCCCG TGCTGTGCTC
GGGCTTCCGA TTCCGGAAAT CGAATTGCTG CGGGTGGGCG CAAGTGCGGT TGTTCTTGCC
GGCAGCGAGG GGGAGAACCC CGTCTATACC GGTCTGGAGG ATGCCCTCAG GCAGGCCGGT
ACCGACATCC GCATTTTCGG AAAACCGACA TCACGCCCAT ACAGGCGAAT GGCCGTGACT
CTGGCTTACG ACCGGCCGGG AAGCGATGTC GACGCAGTGA AAGAAAAAGC TGTCGCCAAT
GCAGGTAAAG TTAGGGTAAT AAGCGAGCAG ACGTCCGGGT TCCCGTCAGG CAAGGGATAG
 
Protein sequence
MMKTIMLLGS GELGREFVIA AKRLGQYVIA VDSYNNAPAQ QVADEREVID MLDGNALDAL 
VARHRPDMIV PEIEAIRTER FYDYEEQGIQ VVPSARAANF TMNRKAIRDL ASKELGLRTA
RYRYAASLEE LRTSVSEVGI PCVVKPLMSS SGKGQSTVKT EEDIERAWSY SQSGRRGDIA
EVIVEAFVPF HTEITLLTVT QKNGPTLFCP PIGHRQERGD YQESWQPCRI ADAQLHEARE
IAENVTHSLT GAGIWGVEFF LADDGLYFSE LSPRPHDTGM VTLAGTQNLT EFELHARAVL
GLPIPEIELL RVGASAVVLA GSEGENPVYT GLEDALRQAG TDIRIFGKPT SRPYRRMAVT
LAYDRPGSDV DAVKEKAVAN AGKVRVISEQ TSGFPSGKG