Gene HS_0549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0549 
SymbolpurT 
ID4240032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp582060 
End bp583241 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content41% 
IMG OID638104098 
Productphosphoribosylglycinamide formyltransferase 2 
Protein accessionYP_718760 
Protein GI113460694 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) 
TIGRFAM ID[TIGR01142] phosphoribosylglycinamide formyltransferase 2 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAATAT TAGGAACAGC TTTAACACCT AAAGCAACAA AAGTCATGCT TCTTGGTTCA 
GGTGAACTAG GCAAAGAAGT CGTTATAGAA TTACAACGTC TTGGTGTAGA AGTGATTGCG
GTAGATCGCT ATGAAAATGC ACCGGCACAA CAAGTTGCAC ACCGCTCTTA CACTATTTCA
ATGTTAGATG GAAATGCTTT GAAAGCATTA ATTGAAAAAG AAAAACCGGA TTATATCGTG
CCGGAAGTCG AAGCAATCGC AACCTCAACA CTAGTTGAAT TGGAGCAAGC CGGCTTTAAT
GTTGTCCCAA CTGCTAAAGC TACACAATTA ACGATGAACC GTGAAGGTAT TCGTCGCCTT
GCAGCAGAAA AATTAAAGTT GCCAACATCA AATTATCAAT TTGTAGATAA TTTTGATGAC
TTTCAAAGTG CGGTCGAAAA CCTTGGCATT CCTTGTGTGA TCAAACCGAT TATGTCTTCT
TCTGGACATG GTCAGAGTAT CTTAAAAAGC AAAGACGACC TACAAAAAGC ATGGGACTAT
GCACAACAAG GGGGAAGAGC GGGAGCCGGT CGAGTGATCG TTGAAGGTTT CGTTAAATTT
GATTATGAGA TTACTTTATT AACAGTACGC CATGCAGAAG GAACTTCATT TCTAGCACCT
ATCGGACATC GTCAAGAAAA AGGGGACTAT CGTGAGTCTT GGCAACCGCA AGCAATGTCT
GTACAAGCAC TCGCCAAGGC TCAACATATT GCTGAGAAAA TTACTACGGA ACTCGGCGGA
CGCGGCATTT TTGGTGTAGA AATGTTTGTC TGTGGAGATG AAGTGATTTT CAATGAAGTA
TCTCCTCGCC CTCATGATAC AGGAATGGTT ACGTTGATCT CGCAAGAGCT TTCTGAATTT
GCACTTCATG CAAGAGCCAT ATTAGGATTA CCAATTCCTG AAATTAATTT AATTAGCCCA
GCCGCCTCAA AAGCGATTGT CGTTGAAGGC AAATCTAACC AAGTACAATT TGGTAATTTA
TTCGAAGTAT TACAAGAACC TAATACTAAT ATTCGCTTAT TCGGCAAAGG CGAAGTCAAT
GGGCATCGTC GCTTAGGTGT TATTCTTGCA CGTGATATTT CTGTTGATAA GGCGTTAGAA
AAAGTTTCTC GAGCCTATGA TAAATTAGAC ATACAATTGT AG
 
Protein sequence
MTILGTALTP KATKVMLLGS GELGKEVVIE LQRLGVEVIA VDRYENAPAQ QVAHRSYTIS 
MLDGNALKAL IEKEKPDYIV PEVEAIATST LVELEQAGFN VVPTAKATQL TMNREGIRRL
AAEKLKLPTS NYQFVDNFDD FQSAVENLGI PCVIKPIMSS SGHGQSILKS KDDLQKAWDY
AQQGGRAGAG RVIVEGFVKF DYEITLLTVR HAEGTSFLAP IGHRQEKGDY RESWQPQAMS
VQALAKAQHI AEKITTELGG RGIFGVEMFV CGDEVIFNEV SPRPHDTGMV TLISQELSEF
ALHARAILGL PIPEINLISP AASKAIVVEG KSNQVQFGNL FEVLQEPNTN IRLFGKGEVN
GHRRLGVILA RDISVDKALE KVSRAYDKLD IQL