Gene Paes_2331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_2331 
SymbolpurT 
ID6459404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2505511 
End bp2506692 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content57% 
IMG OID642726297 
Productphosphoribosylglycinamide formyltransferase 2 
Protein accessionYP_002016969 
Protein GI194335109 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) 
TIGRFAM ID[TIGR01142] phosphoribosylglycinamide formyltransferase 2 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.659291 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGC CGAAAAAAAT AATGCTGCTT GGCAGCGGGG AATTAGGAAA GGAGTTCGTT 
ATTGCCGCCA AACGGCTCGG GCAATTTGTG ATTGCGGTTG ACAGTTACCA TGACGCTCCG
GCGCAACAGG TGGCCGATGA ACGTGAAGTG ATCGATATGC TTGACGCCGA GGCACTCGAC
GCCATTGTGG CGAAACATAG TCCGGAGATC ATCGTGCCTG AAATCGAAGC TATCCGAACC
GAGCGTTTCT ACGACTACGA ACAGCAGGGG ATACAGGTCG TGCCCTCGGC CCGGGCTGCG
AATTTCACGA TGAACCGGAG AGCGATCCGT GACCTGGCAG CCAAAGAGCT GGGCTTGAGA
ACGGCGGACT ACCGTTATGC GGCATCGTTC GAGGAACTGC AGCTCGCCAT TGAAGCGATC
GGATTGCCCT GTGTCGTCAA ACCACTGATG AGCTCGTCGG GCAAGGGGCA GTCGGTCGTC
AGAAACAGTG CCGATATCGG TCAGGCATGG GACTATTCGC AGAGCGGCAA GCGTGGCGAC
AGTACAGAGG TAATCGTCGA AGCATTCGTC TCGTTCCATA CCGAGATCAC CCTCCTGACG
GTAACGCAGC ACAACGGCCC GACGCTGTTC TGTCCTCCGA TCGGGCATCG TCAGGAACGG
GGCGATTATC AGGAGAGCTG GCAGCCGTGC CTCATCGATG AAAAATATCT GCGACAAGCA
GAGGAAATGG CTGACAAGGT GACCAGTTCG CTCGGCGGAG CGGGGATCTG GGGTGTCGAG
TTTTTTCTTG CCGATGACGG CCTCTATTTC TCGGAACTTT CGCCCCGACC ACACGATACC
GGCATGGTTA CGCTTGCAGG CACCCAGAAC CTGACGGAAT TCGAACTGCA TGCACGCACG
ATCCTGGGAC TGCCGATTCC TGAAATCCAG CTCCTGCGCG CCGGAGCCAG CGCAGTGATT
CTGGCTGACA GAGAAGGCGA CAATCCCCGA TTCACAGGCC TGAAAGAGGC GCTGACCGAT
CCCGACACAG ACATTCGGAT CTTCGGAAAA CCGACAACCC GCCCATGCCG CCGCATGGGT
GTAGCGCTGG TTTCAGGCAA GCCCGATGCC GATCTGGCGA GCCTCAAGCA ACAAGCCATC
AGCAATGCCG CCAGAGTTAC CGTCGTCTGC GATGAGCGTT GA
 
Protein sequence
MTMPKKIMLL GSGELGKEFV IAAKRLGQFV IAVDSYHDAP AQQVADEREV IDMLDAEALD 
AIVAKHSPEI IVPEIEAIRT ERFYDYEQQG IQVVPSARAA NFTMNRRAIR DLAAKELGLR
TADYRYAASF EELQLAIEAI GLPCVVKPLM SSSGKGQSVV RNSADIGQAW DYSQSGKRGD
STEVIVEAFV SFHTEITLLT VTQHNGPTLF CPPIGHRQER GDYQESWQPC LIDEKYLRQA
EEMADKVTSS LGGAGIWGVE FFLADDGLYF SELSPRPHDT GMVTLAGTQN LTEFELHART
ILGLPIPEIQ LLRAGASAVI LADREGDNPR FTGLKEALTD PDTDIRIFGK PTTRPCRRMG
VALVSGKPDA DLASLKQQAI SNAARVTVVC DER