Gene BURPS668_A3097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A3097 
SymboltyrB 
ID4888353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2935446 
End bp2936663 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content70% 
IMG OID640133033 
Productaromatic amino acid aminotransferase 
Protein accessionYP_001064088 
Protein GI126444155 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1448] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGAAC ATCTTCCCGC CCATCCCGGC GACCCGATCC TGTCGCTGTT CCAGGCGTTT 
CAGCGCGATC CCGAGCCGCG CAAGGTCAAT CTGAGCATCG GCCTTTACTA CGACGAAAAC
GGCGCCGTGC CGGTGCTCGA CAGCGTGCGC GCGGCGGCGG CGCGGCTGGC CGCGCGGGAC
GACGCCCACA CGTATCTGCC GATGGAAGGC ATGGCCGACT ACCGGCGCGC GCTGCAGGCG
CTCGTGTTCG GCGCGAACAG CGCCGCGCTG CGCGAACAGC GGATCGCGAC CGTGCAGACG
GTAGGCGGCT CCGGCGCGCT GCGCCTCGGC GCGGATCTGC TCAAGCGCTA TTTCCCCGAC
AGCGCGATCT GGATCGGCGA TCCGACGTGG GACAACCACC GCGTGCTGTT CGCCGCGGCG
GGACTCGACG TGCATACGTA TCCGTATTAC GACGCGGCGA CGAACGGCGT GCGCTTCGAC
GCGATGATGG CGACGCTCGA CACGCTGCCC GCGCGCGCGA TCGTGCTGCT GCAGCCGTGC
TGCCACAACC CGACGGGCAT CGATCTGTCG CGCGGGCAGT GGCGCGAGAT CGCCGCGCTG
TGCGAGCGGC GCGCGCTGAT TGCGTTTCTC GACATCGCGT ATCAGGGCTT CGGCGACGGC
CTCGACGACG ACGCGTGGCC GATCCGCGCG ATGGCCGATG CGGGGCTGCC CGTGTTTGTC
AGCCATTCGT TCTCGAAGAA CTTCTCTCTG TACGGCGAAC GCTGCGGCGG GCTGTCGATC
GCATGCGCGA ACGAACGCGA AGCCGCACGG GTGCTGAGCC AGATCCAGGC GGGCGTGCGC
CGCGTCTATT CGAGCCCGCC GCTGCACGGT GCGCGCCTCG TCTCGACCGT GCTGAACGAT
CCGGCGCTCG CGCGGCAATG GGACCGCGAC GTCGCCGCGA TGCGCGCGCG AATCAAGCGG
ATGCGCACCG CGCTCGCCGC GCGGCTCGCG GCGCGCGTGC CCGGCGCGTC GTTCGACTAT
CTCGTCGAGC AGCGCGGGAT GTTCAGCTAC ACGGGGCTCG CGCCCCATGA GGTCGACGCG
CTGCGCGAGC ACGACGGCGT CTATCTGCTG CGCTCCGGCC GCGCATGCAT CGCGGGGCTG
AGCGATGCGA ACGTCGACCA TGTCGCGAAC GCGATCGCCG CGGTGTTGAA GGCGCGGCGA
GCGCGCGCCG CGGCGTGA
 
Protein sequence
MFEHLPAHPG DPILSLFQAF QRDPEPRKVN LSIGLYYDEN GAVPVLDSVR AAAARLAARD 
DAHTYLPMEG MADYRRALQA LVFGANSAAL REQRIATVQT VGGSGALRLG ADLLKRYFPD
SAIWIGDPTW DNHRVLFAAA GLDVHTYPYY DAATNGVRFD AMMATLDTLP ARAIVLLQPC
CHNPTGIDLS RGQWREIAAL CERRALIAFL DIAYQGFGDG LDDDAWPIRA MADAGLPVFV
SHSFSKNFSL YGERCGGLSI ACANEREAAR VLSQIQAGVR RVYSSPPLHG ARLVSTVLND
PALARQWDRD VAAMRARIKR MRTALAARLA ARVPGASFDY LVEQRGMFSY TGLAPHEVDA
LREHDGVYLL RSGRACIAGL SDANVDHVAN AIAAVLKARR ARAAA