Gene Haur_0053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0053 
Symbol 
ID5731925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp68112 
End bp69212 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content52% 
IMG OID641277174 
Productaminotransferase class I and II 
Protein accessionYP_001542833 
Protein GI159896586 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCAGC CAATCTATCT TGAATGGGCC AAACAACAGC CTGCCACTGC GATTTTGAGC 
AATAGCAGCG TTTACTTGCC CAGTGTGCGA AACGCAATTC AGCAACAGTT AGCTGATGCT
GCGTGGTTTG CAGCCGCCCA ACAAGCCAAC CCTTGGGGAT ATCCCAGCCT TCAGCAAGCC
TTGCAAGCAT GGCATCGTAG TAGCTATTCG CCGTTGATTG TGGCTGGCGC TTCGAGTGCC
TTGGCGGTGG TATGCCAAGC CTTGTTGCAA CCAGGCGATC ATGCTCTGAT CGAAACCCCG
CAATATGAGC CATTCAGCCG TTGTGTCGCC GCCCGAGGAG CCACTTGGAG CGCTTGGCCA
CGCCATCCGC AGAGCTTTGA ATTGCAATTA GACCAATTGG CCAGCGGCAT CACGCCTCGG
ACCAAGCTGT TGTTAATCAG CAACTTGCAT AACCCCAGCA GCACACTGAG CAACCAATCC
CAATTGCTTC GCTTGCAACA AACCCTTACT CAAGCCACAG ATACGCTTGG AATTGCTCCG
ATCACGATGG TGGTTGATGA GATTTATTGG CATTTGGTAG CACAAGCTGA GTTTCGTTCG
GTAGCCGAAC TTGGGCCGCA GTGGATCGGG ATCAACAGCC TGTCGAAGGT GTATGGGCTA
AGCATGCTGC GTTGTGGCTG GATTATGGCC GCGCCCCAGC TGCTCGATCA ACTACGCCCA
GCTTATCTCG ATTTGATTAA CATTGGTTCG CCGTTGACTG AATATTTGGC GGCGAGCATT
ATTGAACAAT TGGCCAATTA TCAAACTGCT GCCCAAGCCC ATGTTGCCGT CAATCGCCAA
ATAGTGCAAC GCTATATGCA GCCGTTGCTT GAGCGTGAAT TGATCAACGG TGTAATTCCT
GCGGCGGGCT GCACTTATTT TCCCAAAATC ATGCTTGATC AAACCCAAAT TGACCACGTA
GCCCAACAAA CTGGCTGTGT GCCTGGGCGG TTTTTCGGCT CGGCCTATCA GCAGCAGCTA
CGAATTGGCT TTGGCGGCCC GAGTAATTTA ATTGAAGCAG CCTTGAAACC CTTTACCCAA
ACAATCCTAC CACTTCAATA A
 
Protein sequence
MNQPIYLEWA KQQPATAILS NSSVYLPSVR NAIQQQLADA AWFAAAQQAN PWGYPSLQQA 
LQAWHRSSYS PLIVAGASSA LAVVCQALLQ PGDHALIETP QYEPFSRCVA ARGATWSAWP
RHPQSFELQL DQLASGITPR TKLLLISNLH NPSSTLSNQS QLLRLQQTLT QATDTLGIAP
ITMVVDEIYW HLVAQAEFRS VAELGPQWIG INSLSKVYGL SMLRCGWIMA APQLLDQLRP
AYLDLINIGS PLTEYLAASI IEQLANYQTA AQAHVAVNRQ IVQRYMQPLL ERELINGVIP
AAGCTYFPKI MLDQTQIDHV AQQTGCVPGR FFGSAYQQQL RIGFGGPSNL IEAALKPFTQ
TILPLQ