Gene Arth_1318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1318 
Symbol 
ID4446172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1479414 
End bp1480598 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content68% 
IMG OID639689126 
Producthomoserine O-acetyltransferase 
Protein accessionYP_830812 
Protein GI116669879 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATTG CCGTCACCCG CAGCGGTGTA CCCGAAACAT CCAGCCACAG CCTGTCAGCC 
CGTGACGTGA AAACCACCGC GGGCAAAACC GCAGGCACTG TCCCCGACGG TACCGTCAGG
TTCCAGGGCA TCGGCGGGCT TGACCTTGAA GCCGGCGGGC ATCTGCCGGA CGTCACACTC
GCCTACGAGA CGTGGGGCAC GCTGAACGCG GACCGTTCCA ACGCCGTGCT GGTGCAGCAT
GCCCTGACCG GCAGCACGCA CGTTACCAGG GGAGCCAGTG ACGAAGAAGG CTGGTGGGAG
CAGCTTGCCG GGCCCGGCGC CCCGGTTGAT ACGGACAAGT ACTTCGTGGT TTCCATCAAC
ATCCTGGGCG GTTGCTACGG CTCCACCGGG CCTTCCACTC CCGCGCCGGA CGGCAGGCCG
TGGGGCTCGC GCTTCCCCCT GGTGACCCTG CGCGACACCA CTGCGGCCGA GGCCCGGTTG
GCGGACGCCC TTGGCATCGA CAGCTGGTAC GCCGTCCTGG GCGGATCCCT GGGTGGAGCC
CGCGCCTTGG AATGGGCCGT TAGCTTCCCT GACCGGGTCC GGCGCTGTGC CGTCATTTCC
ATCGGGGCCA GCAGCACTGC CGAGCAGATC GCCTTTGCCC AGGCGCAGAC CCTCGCCATC
CGCCAGGACG TCAACTTCAA CGGCGGTGAC TACTACGGCG GCCCGGAGCC TGAGGCCGGC
CTGGCCCTGG CGCGCAGGAT CGCGCACATC ACGTACCGCT CCGCAGACGA GCTGGAGGCC
CGGTTCGGCC GGAGCGCCCA GGGCGGCGAA GCCCCGCTTC AGGCAGTCTC GCTGGGAGAC
CGCGGCCGCT ACCAGGTGGA GAGCTACCTC GACCATCAGG GCACCAAGCT GGTCCGCCGC
TTCGATGCCA ACAGCTACAT CGCCATCACG GAAGCGCTCA TGAGCCACGA CGTCGGCCGG
GGACGCGGCC CGCTCAAGGA CGCGCTGGCC CAGGCCAAGG CTGAGTTCTT CATCGCCGCC
GTTAACACCG ACCGGCTGTA TTTTCCTGCA CAGTCCCGCG AACTGGCGGC GGCACTGCCG
GGCGACGTCC CGGTGCACAT CATCGAGGCG CCCATCGGCC ACGACGGTTT CCTGACTGAA
ATCGGGCAGC TTAGCGCGCA GCTGAGGCAG AACTTTTTCG CCTAG
 
Protein sequence
MTIAVTRSGV PETSSHSLSA RDVKTTAGKT AGTVPDGTVR FQGIGGLDLE AGGHLPDVTL 
AYETWGTLNA DRSNAVLVQH ALTGSTHVTR GASDEEGWWE QLAGPGAPVD TDKYFVVSIN
ILGGCYGSTG PSTPAPDGRP WGSRFPLVTL RDTTAAEARL ADALGIDSWY AVLGGSLGGA
RALEWAVSFP DRVRRCAVIS IGASSTAEQI AFAQAQTLAI RQDVNFNGGD YYGGPEPEAG
LALARRIAHI TYRSADELEA RFGRSAQGGE APLQAVSLGD RGRYQVESYL DHQGTKLVRR
FDANSYIAIT EALMSHDVGR GRGPLKDALA QAKAEFFIAA VNTDRLYFPA QSRELAAALP
GDVPVHIIEA PIGHDGFLTE IGQLSAQLRQ NFFA