Gene Cphy_2787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2787 
Symbol 
ID5742102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3392857 
End bp3394113 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content37% 
IMG OID641293878 
Producthistidine--tRNA ligase 
Protein accessionYP_001559886 
Protein GI160880918 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3705] ATP phosphoribosyltransferase involved in histidine biosynthesis 
TIGRFAM ID[TIGR00443] ATP phosphoribosyltransferase, regulatory subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAGAT TATTACACAC CCCAGAGGGT GTTCGTGATA TTTATAACTC TGAGTTTGCA 
AAGAAAAAGA TGCTAGAACA AGAGCTTAGT AAGCGCTTGG CACTGCATGG ATTTCATGAG
ATACAAACCC CGATGTTTGA GTTTTTTGAT ATTTTTAGCA AAGAACGTGG AAGTGTAAGT
GGTAAAGAGA TGTATAAGTT TTTTGATAGG GAAGGTAATA CCTTAGTACT TCGCCCAGAT
ATCACACCAT CCATAGCACG CTGCGTTGCT AAATACTATA AGACAGAAGA GATGCCAATA
CGTTTAAGTT ATTGTGGAAG CACTTTTATT AATAGTAGCA GTTACCAGGG GAAATTAAAG
GAAACTACTC AGTTAGGAGC AGAATTAATT AATGATGCAA GTATAGAAGC GGATGCTGAG
ATGATTGCAT TGACGGTAGA ATGCTTAAAA TGTGCCGGTT TAAAAGAGTT TCAAGTAGAA
ATAGGTCAAG CTGACTTCTT TCTCGGAATT GTAGAAGAAG CAGGATTTGA TGAAGATGAA
ACCGAACAGT TACGTATTCT GATTGAAAAT AAGAATTTAT TTGGTGTGGA AGAACTAATT
AGTGGAAAGA AGTTAGAAAA ACCTGTAAAG AGTGTTATTT TACAGTTAAC TGACCTTTTT
GGAACACTAG ATAAAGTACT TGGTGTGAAG GAATCCATCC ATAATGAACG TGCAAGAAAT
GCTTTAGAGC GTATGGAAAA ACTATATGAA TTATTAACAC TCTATGGATA TGAACAGTAT
ATCACCTTTG ATTTAGGAAT GCTTAGTAAG TACAATTACT ATACCGGAAT TATTTTCAGA
GCTTACACCT ATGGAACCGG TGATGCGGTG ATTACTGGAG GTCGTTATGA TTCTTTGGTT
TCGCAGTTTG GAAAGCAGGC ACCAGCGATT GGTATGGCTG TTTTAATAGA CCAACTTCTG
ACTGCACTAA GTAGGCAAAA ACTATTAGGA GAGCCTGAAT TAGAGAATAC CTTGATTGTT
TACGATTCTT CTTATATCGC AAATGCTGTG GCTCTTGCCA ATCATTTTCG TGGACAGGAG
ATGAAGATTG AAATGTTGGC CCATGACGAA AGAAAAACGA GAGAGGATTA TATAGCGTAT
GCAAATCGTA TGAGTATTGG TGGTATTCTT GCATTATTTA CCGAAGACGA GGTAGAGGTG
ATTCATGCAA TCGATGGAAC AGTACAAACG GTACCACTCA AAGGAATGTT ATCCTAG
 
Protein sequence
MDRLLHTPEG VRDIYNSEFA KKKMLEQELS KRLALHGFHE IQTPMFEFFD IFSKERGSVS 
GKEMYKFFDR EGNTLVLRPD ITPSIARCVA KYYKTEEMPI RLSYCGSTFI NSSSYQGKLK
ETTQLGAELI NDASIEADAE MIALTVECLK CAGLKEFQVE IGQADFFLGI VEEAGFDEDE
TEQLRILIEN KNLFGVEELI SGKKLEKPVK SVILQLTDLF GTLDKVLGVK ESIHNERARN
ALERMEKLYE LLTLYGYEQY ITFDLGMLSK YNYYTGIIFR AYTYGTGDAV ITGGRYDSLV
SQFGKQAPAI GMAVLIDQLL TALSRQKLLG EPELENTLIV YDSSYIANAV ALANHFRGQE
MKIEMLAHDE RKTREDYIAY ANRMSIGGIL ALFTEDEVEV IHAIDGTVQT VPLKGMLS