Gene Phep_1874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1874 
Symbol 
ID8252978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2166379 
End bp2167437 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content43% 
IMG OID644935525 
Producthomoserine O-acetyltransferase 
Protein accessionYP_003092144 
Protein GI255531772 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000788104 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACAA TCTCCATATA TAATTACAAC AAAACCTTTA AACTTGAAAA TGGCAAAAAG 
TTGCGTAAGC TTGAAATTGC CTATCAGACT TATGGTAAAT TAAATGCCAA AAAGGACAAT
GTAATTTGGG CCTGTCATGC ACTTACAGCG AATTCTGATG TGCTGGATTG GTGGAAAGGG
CTTTTTGGCA ACAATGCGCT GTTTAATCCT GATGAACACT TTATCATATG TGCCAATGTA
TTGGGCTCGC ATTATGGCAG CACCAACCCA TTGAGTACCA ATCCGGTAAC TGGTCAGCCT
TATTACCTGG CCTTTCCGGA GTTTACCATC AGGGACCTGG TTGCAGCACA CCGGCTGCTT
GCAGCACATC TGGGGATCAG TACGGTTAAG GTATTGATTG GCGGTTCATT AGGGGGACAA
CAGGCATTGG AATGGGCCAT CACTGATAAC AATGCCATAG AGAACCTTAT TTTAGTGGCC
ACCAATGCCG TACACTCGCC ATGGGGCATA GCCTTTAATG AGAGCCAGCG GCTGTCCATT
ACAACGGACC GTAGTTTTTA TGCACAAAAA CCTGATGGCG GGTTAAAAGG ATTAAAAGTT
GCCAGAAGCA TTGCTTTATT GTCCTACAGG ACCTATGATG CCTATTCGGC TACCCAGCTG
GAAAGTGTGA ACGATAAAAT CGGTAGCTTC AGGGCTTCTT CCTATCAGAA TTACCAGGGA
GAAAAACTCT GTAAGCGTTT CAATGCTTAC AGCTACTGGT ACCTGAGCAA AGCGATGGAC
AGTCATAATG TAAGCAGGAA CAGAAATAGT GTAATTGACG CCCTGGCATT GGTAAAAGCA
AATACCCTGG TAATAGGTAT TGAAAACGAC ATTCTTTTCC CATTGGCAGA ACAGGAGTTC
ATGGCAGAAA ACATCCCTGG TGCAGAATTC CAAAGTCTGA AGTCGGCCTA CGGCCATGAC
GGTTTCCTGA TTGAAACAGA TGCGCTTACA AACGTTATTG GTAATTTCCT TAAAGAGAGC
GTACACAAGA AAATAATTAA ATTACATAAA ACAGCATAA
 
Protein sequence
MSTISIYNYN KTFKLENGKK LRKLEIAYQT YGKLNAKKDN VIWACHALTA NSDVLDWWKG 
LFGNNALFNP DEHFIICANV LGSHYGSTNP LSTNPVTGQP YYLAFPEFTI RDLVAAHRLL
AAHLGISTVK VLIGGSLGGQ QALEWAITDN NAIENLILVA TNAVHSPWGI AFNESQRLSI
TTDRSFYAQK PDGGLKGLKV ARSIALLSYR TYDAYSATQL ESVNDKIGSF RASSYQNYQG
EKLCKRFNAY SYWYLSKAMD SHNVSRNRNS VIDALALVKA NTLVIGIEND ILFPLAEQEF
MAENIPGAEF QSLKSAYGHD GFLIETDALT NVIGNFLKES VHKKIIKLHK TA