Gene Haur_0117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0117 
Symbol 
ID5732010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp151209 
End bp152342 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content48% 
IMG OID641277239 
Productaminotransferase class I and II 
Protein accessionYP_001542897 
Protein GI159896650 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00105897 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTTTTTG ATGATACTGA TTTAACGCCC AACCGGATTG AAATGGCTCG TCGCGCTGCG 
ACGGGTCATT TTATTGATCT GACCAGTAGT AACCCAACGC AGCAAGATTT GCTTTTTCCG
CCCGATGTGC TGGAGCAAGC CGCCCAAAAA TATTGGTCAC AACGGTGCTA CGAGCCAAAT
CCACGCGGTT TGGAGCCAAC TCGGCAGGCA ATTATTGATT ATTATGCTCA GCGCAGGCCA
GCTTTGGCGC TAACGCTTGA TGATATTTTT ATTACTGCCA GCACCAGCGA GGCCTATAGT
TTGTTGTTCT CGTTGCTCAC CGCACCAGGC GATAATATTC TTGGGCCAAA TGTCACCTAT
CCATTGTTTG AATATTTGGC CGATTTGCAT CACGTTGAGT TGCGTACCTA CGAGCTTGAT
CCTGCTAACA ATTGGGTGAT CGATCAGGCT TCGTTGCTGG CCGCTGCTGA TCAGAACACA
CGGGCGATTT TATTAATTTC ACCGCATAAT CCAACTGGGG CAATTATTAG CGAGCCAATC
GCCGCATTAA ATCAACTAGG AATTCCATTG ATTTGCGATG AAGTGTTTGC GCCGTTTGCG
TTGGCCAAAT CGCATGTGCC AGCATTAGGC GGGCTGCATC CCGACGTACC CGTATTTCAA
TTGAATGGCA TTTCTAAGCT CTTAGCCTTG CCCGACCTCA AACTAGGCTG GATTGCGCTG
AATCAAGCGG CTCAAGGCTA TGCAGAGCGT TTAGAACTGA TCAACGATAC CTTTTTGAGT
TGTAGCACAT TAATTCAGAC GATGCTGCCC GATTTGTTGC ATGCTGCGCC GCCGTTTATC
GATCAGATGC TTGAGCGAGT GCGAGCGAAT ATTGCCTATG CCCGTGAGCA TTTAGCCCAG
CACCCACGCT TGATTTGGAG CGAGCCAGAT GGTGGCTATT ATTTATTTTT GCAAGTACGC
GATGAATTCG ATGATGAAGC CTTGGTCGTG CGTTTGATCG AACAAGGGGT GTTGGTGCAT
CCAGGCTTCT TTTTCGATTG GATCGATGAT TGTCGGATTA TGCTCTCGGC CTTGACCGAA
CCGCACCAAT GGCAAGCAGG TATCCAAAAA TTGGCTCAGA TTCTTACAAT TTGA
 
Protein sequence
MVFDDTDLTP NRIEMARRAA TGHFIDLTSS NPTQQDLLFP PDVLEQAAQK YWSQRCYEPN 
PRGLEPTRQA IIDYYAQRRP ALALTLDDIF ITASTSEAYS LLFSLLTAPG DNILGPNVTY
PLFEYLADLH HVELRTYELD PANNWVIDQA SLLAAADQNT RAILLISPHN PTGAIISEPI
AALNQLGIPL ICDEVFAPFA LAKSHVPALG GLHPDVPVFQ LNGISKLLAL PDLKLGWIAL
NQAAQGYAER LELINDTFLS CSTLIQTMLP DLLHAAPPFI DQMLERVRAN IAYAREHLAQ
HPRLIWSEPD GGYYLFLQVR DEFDDEALVV RLIEQGVLVH PGFFFDWIDD CRIMLSALTE
PHQWQAGIQK LAQILTI