Gene Sfum_3303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3303 
Symbol 
ID4458366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4044426 
End bp4045691 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content61% 
IMG OID639704075 
Producthistidyl-tRNA synthetase 
Protein accessionYP_847411 
Protein GI116750724 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.375458 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAGCA TCCAGGCCAT CAGAGGCATG AACGATATTC TGCCGGGGCA GATCGAATGG 
TGGCAGAAAG TGGAGAAAGC CGCCCGCGAA GTCCTGGAGG ATTTCGGCTA CCGTGAGATT
CGAACTCCCG TCCTCGAGAA GCTCGAGCTC TTCGCCAGGG GAATCGGCGA GAGCACCGAC
ATTGTGGAAA AGGAAATGTA CGCCTTCCCC GACCGCAAGG GGGACATGCT GGGTCTGCGT
CCGGAGGCCA CCGCATCGGT GGTGAGAGCA TACATCCAGC ATAATCTGCA GGCGGATCCC
TTCACTCAGA AGTTCTACCT GATGGGACCG ATGTTCCGCC ATGAACGCCC CCAGAAAGGG
CGCTATCGAC AGTTTCACCA GATCGACGCG GAAGCGTTCG GCATCGACGA TCCCATGCTC
GACGCCGAAG TCATGTACAT GCTCCGGCTT TTCTTCGAAC GCGTCGGCCT GAGCGGAGTC
GTTCTGCACA TCAACTCCCT AGGCTGCCAT GAATGCCGGC AGGAATATCG TTCGGTTTTG
AAGGAATATC TCGGAGGCCA CGTCGAGCGC CTGTGCCCGG ATTGCCTGCG GCGGCGCGAG
ACCAATCCCC TGCGGGTGTT CGACTGCAAG GTGGAACGTT GCCAGGCTGT GCTCGAGGAC
GCTCCTTTGC TGCCGGACTA CATCTGCGGC GACTGCGGAG AACACTTCGC CCGAGTAAGG
GACTACCTCC AGCAGCTCCA AACGGATTTC GTCATCGACC CGAGAATGGT GCGGGGATTG
GACTATTACA CGCGAACCAC CTTCGAAGTC ATAACGGACC GCCTGGGAGC TCAAAATGCC
GTGGGGGGCG GCGGACGCTA CAACGGGCTG GTACGGGATC TGGGAGGGCC GGACTTGCCC
GGCATCGGGT TCGCCATCGG GATGGAACGC CTCATCCTGC TGCTCCAGCA GGAAGGGGAG
GAATCGAAGC GAAGCCCGCG GCTGTTCATC GCAACCCTGG GGGAAGCGGC AAGACTGAAG
GGCTTTCTGC TGGCCCAGCA GTTTCGAGCC CTCGGCGTTT CGACCGAAAC GGACTATGAA
GCCAGGAGCC TCAAGAGCCA GATGCGCCGC GCCGACCGGT CGGGAGCGCG TTACGTACTC
ATCCTGGGAG AAGAGGAAAT CGCCCGGGGC GAAATCCAGC TCCGGGACCT GCGGGAGAAG
TCCCAGGTCA ATCTGCCCCT GGCGTCGGCG TCGGAAACCG TCCACCGGAT GTGTCGAGAC
GCCTGA
 
Protein sequence
MESIQAIRGM NDILPGQIEW WQKVEKAARE VLEDFGYREI RTPVLEKLEL FARGIGESTD 
IVEKEMYAFP DRKGDMLGLR PEATASVVRA YIQHNLQADP FTQKFYLMGP MFRHERPQKG
RYRQFHQIDA EAFGIDDPML DAEVMYMLRL FFERVGLSGV VLHINSLGCH ECRQEYRSVL
KEYLGGHVER LCPDCLRRRE TNPLRVFDCK VERCQAVLED APLLPDYICG DCGEHFARVR
DYLQQLQTDF VIDPRMVRGL DYYTRTTFEV ITDRLGAQNA VGGGGRYNGL VRDLGGPDLP
GIGFAIGMER LILLLQQEGE ESKRSPRLFI ATLGEAARLK GFLLAQQFRA LGVSTETDYE
ARSLKSQMRR ADRSGARYVL ILGEEEIARG EIQLRDLREK SQVNLPLASA SETVHRMCRD
A