Gene Tcr_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_1839 
SymbolmetX 
ID3761052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp2013967 
End bp2015124 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content46% 
IMG OID637786583 
Producthomoserine O-acetyltransferase 
Protein accessionYP_392105 
Protein GI78486180 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0107828 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGATG AAATTGGTAT AGTAACCCCT CAAAAACTCC ATGTTTCGAC CCCTCTTGAG 
ATGGTCAGTG GCTCCACTTT GCCTGAATAT GACCTGGCCT ACGAAACCTA CGGCAGCCTA
AATGCCGATA AAAGTAATGC CATTTTAATT TGCCATGCAT TAAGTGGAAA CCATCATGTG
GCCGGTCAAT ATGAAGGGGA ATCAACCAGA GGTTGGTGGG ATGGCTATAT TGGTCCAGGG
AAACCGATCG ATACCAATCG TTTTTTTGTG GTCTGCTCCA ATAATCTAGG CGGTTGCCAT
GGTTCAACAG GTCCTGCCAG TATCAACCCC CTAACCGGAA AAGTGTACGG ACCTGACTTT
CCGATTGTGA CCTGTAAAGA TTGGGTACAC AGCCAAAACA CGCTGCGTCA ACATTTAGAA
ATCGATGCCT GGGCGGCCGT CATTGGGGGA TCAATGGGCG GCATGCAAGT TTTACAATGG
ACCATCGACT TTCCCGATCA AATTCGTCAT GCCATTGTGA TTGCCTCTGC ACCTAAATTA
TCGGCACAAA ACATTGCATT CAACGAGGTC GCACGTCGTG CCATTATGAC CGACCCCGAC
TTTCATGACG GTCGCTTTAT CGAAGCCGGC ACCACGCCGA AAAGAGGATT GGCTTTAGCT
CGAATGCTAG GACATCTAAC CTATTTATCC GATGATATGA TGGGTTCAAA ATTCGGTCGT
GAACTGCGAG AGGGCAAACT TAATTATAAC TTTGATGTGG AATTTCAGGT TGAGAGCTAC
CTTCGCTACC AGGGTGAAAA GTTTGCAACA AAACAAAACT TTGACGCGAA CACCTATTTA
CTAATGACCA AAGCGTTGGA TTATTTTGAC CCTGCCGCCG ACTTTGATGA TGATCTATCC
AAAGCCCTTT CTGGCGCAAC GGCAAAATTT TTGATCATTT CATTTACCAC TGATTGGCGT
TTCTCCCCTG AGCGATCACA TGAAATCGTT AAGGCCTTAC TCGATAACGA TGCCGACATC
AGTTATGCCG AAGTGAATTC ACAGCATGGA CATGATGCCT TTTTATTGCC GAATGACCAT
TATGAAGGTG TTTTTCGTGC CTATATGAAA CGAATTCATG CCGAATTAAA CCACACCTCT
TTGCAGGAAG GAGAATAG
 
Protein sequence
MTDEIGIVTP QKLHVSTPLE MVSGSTLPEY DLAYETYGSL NADKSNAILI CHALSGNHHV 
AGQYEGESTR GWWDGYIGPG KPIDTNRFFV VCSNNLGGCH GSTGPASINP LTGKVYGPDF
PIVTCKDWVH SQNTLRQHLE IDAWAAVIGG SMGGMQVLQW TIDFPDQIRH AIVIASAPKL
SAQNIAFNEV ARRAIMTDPD FHDGRFIEAG TTPKRGLALA RMLGHLTYLS DDMMGSKFGR
ELREGKLNYN FDVEFQVESY LRYQGEKFAT KQNFDANTYL LMTKALDYFD PAADFDDDLS
KALSGATAKF LIISFTTDWR FSPERSHEIV KALLDNDADI SYAEVNSQHG HDAFLLPNDH
YEGVFRAYMK RIHAELNHTS LQEGE