Gene Cpha266_1891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1891 
SymbolpurT 
ID4570850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2192007 
End bp2193176 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content57% 
IMG OID639766473 
Productphosphoribosylglycinamide formyltransferase 2 
Protein accessionYP_912331 
Protein GI119357687 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) 
TIGRFAM ID[TIGR01142] phosphoribosylglycinamide formyltransferase 2 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.216866 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAA AAATCATGCT GCTGGGCAGC GGGGAGCTGG GCAAGGAGTT TGTGATTGCC 
GTTAAACGTC TGGGACACTT TGTGATTGCC GTTGACAGCT ATAATGATGC TCCTGCGCAG
CAGGTGGCTG ACCGGCGTGA GGTGATCAAT ATGCTCGACG GCGCGGCTCT CGATGCCATC
GTTGCCCGGC ATCAGCCCGA TGTGATCGTG CCTGAAATCG AGGCTATTCG TACCGAGCGG
TTTTACGATT ATGAAAAAGA GGGGATACAG GTTGTTCCTT CGGCCCGTGC CGCCAATTTC
ACCATGAACC GGAAAGCTAT TCGCGATCTG GCTTCGAAAG AGCTTGGTCT TCGAACGGCG
ACGTATCGTT ACGCAGCGTC GAAGGAAGAG CTGAAGAGGG CGATAGGGGA GGTGGGAGTT
CCCTGTGTGG TAAAACCGCT GATGAGCTCG TCGGGCAAGG GGCAGTCAAC CGTTAAAACG
GAGGCTGACA TTGAACATGC CTGGAGCTAT TCGCAAAGCG GCAGGCGCGG TGATAGTGTG
GAGGTGATTG TTGAGGCCTT TGTGCCGTTC CATACCGAGA TTACGCTCTT GACCGTCACG
CAAAAAAACG GCCCGACGCT TTTCTGTCCG CCCATCGGGC ACCGTCAGGA GCGGGGTGAT
TATCAGGAGA GCTGGCAGCC CTGCCGAATC GGCGATGCGC AGTTGCATGA AGCTCAGGAG
ATCGCTGAAA AAGTGACTCG TTCGCTGACA GGTGCGGGGA TCTGGGGTGT GGAGTTTTTT
CTGGCCGATG ACGGGCTTTA TTTTTCGGAG CTCTCCCCCC GTCCGCACGA TACCGGCATG
GTGACGCTGG CTGGTACGCA GAACTTTACG GAGTTCGAGC TTCATGCGCG GGCGGTTTTA
GGGCTTCCGA TTCCGAAGAT CGAACTGCTG CGGGTGGGTG CGAGCGCTGT GGTGTCAGCC
GACAGAGAGG GGAAGAACCC TGATTACAGC GGCCTTGAAG AGGCTCTCGG TGAGCCTTGC
ACCGATATTC GTATTTTCGG AAAACCGGCA ACCCGCCCTT ATCGCCGAAT GGGCGTAACG
CTCGCTTACG ACGAACCGGG CAGCGATGTC GATACGGTGA AGGCGAAAGC CATTGCCAAT
GCCCGCAAGG TGAGGGTGAC GAGCGAGTAG
 
Protein sequence
MQKKIMLLGS GELGKEFVIA VKRLGHFVIA VDSYNDAPAQ QVADRREVIN MLDGAALDAI 
VARHQPDVIV PEIEAIRTER FYDYEKEGIQ VVPSARAANF TMNRKAIRDL ASKELGLRTA
TYRYAASKEE LKRAIGEVGV PCVVKPLMSS SGKGQSTVKT EADIEHAWSY SQSGRRGDSV
EVIVEAFVPF HTEITLLTVT QKNGPTLFCP PIGHRQERGD YQESWQPCRI GDAQLHEAQE
IAEKVTRSLT GAGIWGVEFF LADDGLYFSE LSPRPHDTGM VTLAGTQNFT EFELHARAVL
GLPIPKIELL RVGASAVVSA DREGKNPDYS GLEEALGEPC TDIRIFGKPA TRPYRRMGVT
LAYDEPGSDV DTVKAKAIAN ARKVRVTSE