Gene Hhal_1973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1973 
Symbol 
ID4710452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2174385 
End bp2175674 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content67% 
IMG OID639856446 
Productaminotransferase, class I and II 
Protein accessionYP_001003539 
Protein GI121998752 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.238227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACTT CAAGAGAGGG TGCGAGCCGG CCGCTCCGCC CGGGGCGGCC GACGCGCGAT 
CGCAGTGCAG AGCAACGCGG AGAGCTTGCC GTGGCCTATC CCGACGATAT CGATCCCGAC
GACGAGCGCG AGCCGGTGTG GAGCCCCTCG ATCCGGGCCC TGCCCATTCC CGGCATCCGC
AAGATGGTGA ACATGGCGGC CGAGATGGAC GACGTCATCC ATCTGTCCAT TGGCCAGCCG
GACTTCCCCA TGCCGGAGCA CGTTGTCGAG GCCCACATCC AGGCCCTGCG CGACGGCAAG
ACCGGCTACA CCATGGATGC CGGCCTGCCG CAGATGCTCG AGGCGGTGGC GGAGTACTAC
AGCCACCGCT ACGACCGCCC GCTGGAGCCG GAGAACGTGC TCATCACCAC CGGCGCCACC
GAGGCGATGT ATCTGGCCAT CGCGGCCACC GCGGCGCCTG GGCGGCAGTT CCTGATCCCG
GATCCGACCT TCCCGCTCTA CGCCCCGCTG ATCCGCATGA ACGGCGCCGA GGTCAAGCCG
ATCCCCACCC GCGCAGAGCA CGGTCACCAG ATCGATCCCC AGGAGGTGAT CGACAACATC
GGCATGCGCA CCTTCGGGAT CATCCTCAAC TCGCCGAGCA ACCCCACCGG TACGGTCTAC
CCCCGGGAGA CCATCGAGGC CATCGTCCAG GAGGCCGCCT ACCGTGGGGT CTACGTCTTC
AGCGACGAGG TCTACGACCA CCTGCTGCTC GACGAGATGG AGTATCCGAG TGTGCTGCGC
TGCACCTCGG ACCTGGACCA CGTCATGGCG GTCTCCAGCC TGTCGAAGAC CTTCAGTATG
GCCGGTCTGC GCATCGGCTG GTTGATCTCC AGCCAGGGGG CGATCAAGAA GCTCCAGCGC
TTCCATATCT TCACCACCAC GGTCGCCAAC ACGCCGGCGC AGTGGGCCGG GGTGGCCGCC
CTCAAGGGGG GGATGGCGTG CGTCGACGAG ATGCTCGAGG CCTACCGTCA GCGGCGTGAC
CGCATCGTTG AGCTCGTTAG CAAGACCCCG CACCTGACCA GCTACCGGCC GCAGGGGGCG
TTCTACATCT TCCCGTCGCT GCCGCCGAAC ACCGACGCCA CCAACCTGGC CACGCGCATG
CTCAAGGAGA CCGGCGTGTG TGTCGTCCCG GGCGACGCCT TCGGCGACAG CTGCCCGAAC
TCGTTGCGCA TCAGCTACGC GGCCTCGATG GACGACATCG AGCGGGCCTT CGAGCGCATC
ATCCCGTGGA TGGAGAAGCA GGGCTTCTAG
 
Protein sequence
MTTSREGASR PLRPGRPTRD RSAEQRGELA VAYPDDIDPD DEREPVWSPS IRALPIPGIR 
KMVNMAAEMD DVIHLSIGQP DFPMPEHVVE AHIQALRDGK TGYTMDAGLP QMLEAVAEYY
SHRYDRPLEP ENVLITTGAT EAMYLAIAAT AAPGRQFLIP DPTFPLYAPL IRMNGAEVKP
IPTRAEHGHQ IDPQEVIDNI GMRTFGIILN SPSNPTGTVY PRETIEAIVQ EAAYRGVYVF
SDEVYDHLLL DEMEYPSVLR CTSDLDHVMA VSSLSKTFSM AGLRIGWLIS SQGAIKKLQR
FHIFTTTVAN TPAQWAGVAA LKGGMACVDE MLEAYRQRRD RIVELVSKTP HLTSYRPQGA
FYIFPSLPPN TDATNLATRM LKETGVCVVP GDAFGDSCPN SLRISYAASM DDIERAFERI
IPWMEKQGF