Gene HS_0619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0619 
Symbol 
ID4240105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp665297 
End bp666511 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content36% 
IMG OID638104171 
Productaminotransferase AlaT 
Protein accessionYP_718831 
Protein GI113460764 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGTTT TCCCTAAATC CGATAAGTTA GAACATGTAT GCTACGATAT TCGTGGACCG 
GTACATCAAG AGGCGTTACG TTTGGAAGAA GAAGGCAAAA AGATCTTAAA ATTAAATATT
GGTAATCCAG CACCATTCGG TTTTGAAGCT CCTGATGAGA TTTTGGTAGA TATTTTGCGT
AATTTGCCTG CCGCACAAGG ATATTGTGAT TCAAAAGGTC TATATTCTGC ACGTAAAGCT
ATTGTTCAAT ATTACCAATC TAAGAATATT TTAAGTGCGA CCGTAAATGA TGTATACATC
GGTAATGGTG TTTCTGAACT TATTACTATG TCATTACAAG CATTGTTGAA TGATGGTGAT
GAGGTGTTAA TCCCAATGCC TGATTACCCA TTATGGACGG CTGCTGCAAC GTTAGCCGGC
GGAAAAGCGG TTCATTATTT ATGTGATGAA CAAGCTGATT GGTTTCCTGA TGTGGCAGAT
ATAAAAAGTA AGGTTACTTC TCGCACAAAA GCCATTGTTA TTATTAATCC TAATAATCCA
ACGGGAGCTG TTTATAGTAA AGAATTATTA TTAGATATAG TTGAAGTAGC TCGCCAAAAT
GGATTGATGA TTTTTGCCGA TGAAATTTAC GATAAAATTT TATATGACAA CGCTGTTCAT
CATCATATCG CCGCATTAGC ACCGGATTTG TTAACGGTTA CTTTCAACGG CTTATCAAAA
TCTTATCGTG TAGCAGGATT TCGCCAAGGT TGGATGATTT TAAATGGACC AAAACATCAA
GCAAAAGGCT ATATTGAAGG CTTGGATATG TTGGCATCAA TGCGTTTATG TGCAAATCAT
CCTATGCAAC ATGCGATCCA AACTGCTTTA GGTGGATATC AGAGTATCAA TGAATTTATT
TTACCGGGTG GTCGTTTATT AGAACAACGT AATAAGGCTT ATGAGCTGAT TAATCAAATT
CCGGGATTAA GTTGCACAAA ACCGATGGGC GCACTTTATA TGTTCCCGAA AATTGATACT
AAAAAATTCA ATATCTATGA TGATGAAAAA ATGGTATTGG ATTTATTGCG TCAAGAGAAA
GTGTTATTAG TTCATGGACG TGGTTTTAAC TGGCAGCAAC CAGACCATTT TAGAATAGTA
ACCTTACCTT ATGTTAATCA AATTGAAGAC GCTTTAGGTC GATTAGCAAG ATTTTTAGAG
TATTATCGTC AGTAA
 
Protein sequence
MRVFPKSDKL EHVCYDIRGP VHQEALRLEE EGKKILKLNI GNPAPFGFEA PDEILVDILR 
NLPAAQGYCD SKGLYSARKA IVQYYQSKNI LSATVNDVYI GNGVSELITM SLQALLNDGD
EVLIPMPDYP LWTAAATLAG GKAVHYLCDE QADWFPDVAD IKSKVTSRTK AIVIINPNNP
TGAVYSKELL LDIVEVARQN GLMIFADEIY DKILYDNAVH HHIAALAPDL LTVTFNGLSK
SYRVAGFRQG WMILNGPKHQ AKGYIEGLDM LASMRLCANH PMQHAIQTAL GGYQSINEFI
LPGGRLLEQR NKAYELINQI PGLSCTKPMG ALYMFPKIDT KKFNIYDDEK MVLDLLRQEK
VLLVHGRGFN WQQPDHFRIV TLPYVNQIED ALGRLARFLE YYRQ