Gene Moth_0019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0019 
Symbol 
ID3831892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp18887 
End bp20053 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content61% 
IMG OID637827946 
ProductSerine--glyoxylate transaminase 
Protein accessionYP_428902 
Protein GI83588893 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0075] Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000322298 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000185369 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACTGACA AGCAGATCCT GCTCTTGCCC GGGCCGACGC CGGTGCCGCC GCAGGTGGCC 
CTGGCCATGG CACGTCCGGC GATAAACCAC CGCGGCCCGG AGTTTAAAGC CCTGTGGGCG
GAAGTTACCT CGGGGTTAAA GGACGTTTTC CAGACCCGCG CGGAGGTGGT GATTTTAACC
GCTTCCGGTA CAGGTGGCAT GGAAGCTGCC GTAGCCAATC TCATTTCCCC CGGTGAGAAG
GTGCTGGTCG TGACCATCGG CGCCTTTGGC GAGCGCTTCG TCCAGATCTG CCGGGCCTTT
AACGTGGAGG CGGAGGTCGT AGCCTTCCCC TACGGCCAGG CTGCCGACCC GGAGGTTATA
GCAGAGCGTC TGGCAGCCGA CACCGGGCAT GAGATTAAAG CCATCCTGGT CCAGCATAAC
GAGACCTCGA CAGGAGTTTT AAACGATATC CAGGCTATTA GCCGTGCCCG GGGGGATCAT
CCGGCTTTGC TTATCGTGGA CAGCATCAGC GGCCTGGCGG CGGCTGATTT GCCCATGGAC
GCCTGGCATA TCGATGTGGT TATCGCCGGT TCCCAGAAAG CCTTTATGCT GCCCCCGGGA
TTAACCATGC TGGCTGTGGG CGAGCGCGCC TGGCAGGCGG CTGAGAAATG CTCCAACCAA
CGTTTTTACC TGGATATTAA AAAAGCAAGA AATTCGGGCC TGAAGGGCCA GACGCCCTTT
ACCCCGGCCG TTCCCTTGCT ATATGGTTTA CAAGAATCCC TGCGGCTGCT AAAGGCCGAG
ACCCTGGCCG GCAGCTATGC CCGTCACGCT TTGATGCGGG ACATGGTGCG GGCCGGGGTT
CGCGCCCTGG GCCTGAAGCT CCTGGCCGAC GAGGCAATAG CCTCGCCGGC GGTGACCGCT
GTCTGTGTCC CAGAGGGGAT GAAACCGGCG GATATAATCA ATCCCCTGCG GGAAAGATTT
GGCGTGGTCG TGGCCGGGGG CCAGGGAGCC GTTAAAGACC AGGTCTTCCG CATCGGCCAC
TTAGGGTATG TGAGCTTTAA CGCCATCCTG GCCGGACTGG CCGCTCTGGA GGCCGTTCTG
GCCGACGCCG GGGTACCGGT GACCCGGGGT GCGGCAGTGG CGGCAGCCAG TACTATTTTA
AGTGAAAGTG AGGCTGTAGA TAAGTAA
 
Protein sequence
MTDKQILLLP GPTPVPPQVA LAMARPAINH RGPEFKALWA EVTSGLKDVF QTRAEVVILT 
ASGTGGMEAA VANLISPGEK VLVVTIGAFG ERFVQICRAF NVEAEVVAFP YGQAADPEVI
AERLAADTGH EIKAILVQHN ETSTGVLNDI QAISRARGDH PALLIVDSIS GLAAADLPMD
AWHIDVVIAG SQKAFMLPPG LTMLAVGERA WQAAEKCSNQ RFYLDIKKAR NSGLKGQTPF
TPAVPLLYGL QESLRLLKAE TLAGSYARHA LMRDMVRAGV RALGLKLLAD EAIASPAVTA
VCVPEGMKPA DIINPLRERF GVVVAGGQGA VKDQVFRIGH LGYVSFNAIL AGLAALEAVL
ADAGVPVTRG AAVAAASTIL SESEAVDK