Gene TM1040_3020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3020 
Symbol 
ID4076593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3187424 
End bp3188569 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content61% 
IMG OID638008349 
Productphosphoserine aminotransferase 
Protein accessionYP_615014 
Protein GI99082860 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1932] Phosphoserine aminotransferase 
TIGRFAM ID[TIGR01365] phosphoserine aminotransferase, Methanosarcina type
[TIGR01366] phosphoserine aminotransferase, putative 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.285129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATTG CACAACCGGC ATCGCGGCCG GCCAATCCGC GTTTTTCTTC TGGCCCCTGC 
GCCAAACCCC CCACCTACGA TCTTTCCAAA CTCGCTGGCG CGCCTCTGGG TCGCAGCCAC
CGCGCTGCCG TGGGCAAGGA AAAGCTCCTC GCCGCCATCG AAGGCACCCG CGAGGTTCTG
GGCATCCCCG CAGACTACAA GATCGGCATC GTGCCTGCGT CCGACACCGG CGCTGTAGAG
ATGGCGATGT GGAACCTTCT CGGTGCGCGC AAGGTCGAGA TGCTCGCCTG GGAAAGCTTT
GGCGCAGGCT GGGTCACCGA TGTGGTGAAG CAGCTGAAAA TCGATGCCGA GGTCAAGACC
GCCGAGTATG GCGACATCGT TGATCTTCAG ACGGTGGACA CCAACAACGA CGTGGTCTTC
ACATGGAACG GCACAACCTC CGGCGTTCGG GTTCCCAATG GGGACTGGAT CAAAGACGAC
CGCGAAGGTC TGATCATTTG CGATGCGACT TCTGCGGCCT TCGCACAAGA ATTGCCGTGG
AAGAAACTCG ATGTAACGAC GTTCTCCTGG CAGAAGGTGC TGGGTGGCGA GGCCGCGCAT
GGTGTGATCG TCCTCAGCCC CCGCGCGGTT GAACGCCTCG AGAGCTACAC CCCTGCATGG
CCTCTGCCGA AGATCTTCCG CCTGACCAAA GGCGGCAAAC TGATCGACGG GATCTTTACC
GGTGCAACTA TCAACACGCC GTCGATGCTG GCAGTTGAGG ACTATCTGTT CGCGCTCGAT
TGGGCGCGGT CGGTGGGCGG CGTTGAGGGG TTGATTGGTC GGGCCAATGC CAATGCCAAC
GCGATCCACG CCTTTGCCTA TGCCAACGAC TGGATTGAAA ACCTCGCCAA TGATCCGGCG
ACGCGTTCCA ATACCTCTGT CTGTCTGAAG TTCACGGACA AGCGGATCAA GGATGGTGCG
AGCTTTGCCA AGTCTGTGGC CAAGCGGCTG GAGGCCGAGG GCATCGCCTA TGATATCGGT
GCTTACCGTG ACGCGCCTGC CGGCCTTCGG ATCTGGTGTG GCGGCACGGT CGAGACCGCC
GATATCGAGG CTATGCTGCC ATGGCTAGCA TGGGCCTTTG AGGCCGAGAT CGCCGCGCAA
GGCTAA
 
Protein sequence
MAIAQPASRP ANPRFSSGPC AKPPTYDLSK LAGAPLGRSH RAAVGKEKLL AAIEGTREVL 
GIPADYKIGI VPASDTGAVE MAMWNLLGAR KVEMLAWESF GAGWVTDVVK QLKIDAEVKT
AEYGDIVDLQ TVDTNNDVVF TWNGTTSGVR VPNGDWIKDD REGLIICDAT SAAFAQELPW
KKLDVTTFSW QKVLGGEAAH GVIVLSPRAV ERLESYTPAW PLPKIFRLTK GGKLIDGIFT
GATINTPSML AVEDYLFALD WARSVGGVEG LIGRANANAN AIHAFAYAND WIENLANDPA
TRSNTSVCLK FTDKRIKDGA SFAKSVAKRL EAEGIAYDIG AYRDAPAGLR IWCGGTVETA
DIEAMLPWLA WAFEAEIAAQ G