Gene GM21_2238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2238 
Symbol 
ID8137577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2612243 
End bp2613496 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content63% 
IMG OID644869853 
ProductO-acetylhomoserine aminocarboxypropyltransferase 
Protein accessionYP_003022045 
Protein GI253700856 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value0.429197 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGGAGA AAGAAAAAAA ACTGAGATTC AACACCAAAC TGATCCACGG CGGCACCTCG 
CCCGGGCCCT CCGGCGCCAC CAAGACCCCG ATAGTACAGG CATCCGCCTT CGCTTACGAC
ACAGCGGAAG CGCTGGAAGA CATCTTCAGG GGAAGAGCAG TGGGCCAGGT GTACACTAGG
ATAGGCAACC CCACCATGGA CACCCTGGAG AAGAGGCTGG CCGCGATCGA GGACGGCATC
GCAGCCGTCG TCACCTCTTC TGGCATGGCG GCGATCACCA CCGCGGTCAT GGGAGTGGTA
AGAAGCGGCG ACGAAGTCCT CTCCTCATCC TCGCTTTTCG GGGGGACCTA CTCGCTTTTC
CACGACACCC TGGCCAACTT CGGGATCTAC ACCCGCTTCG TCGACCCTGT CGACCTTGCC
GCGGTCGAGG CGGGGATCAA CGACAAAACC CGACTGATCT TCGTGGAAAC CATCGGCAAC
CCGAAGATGG ACGTTCCCGA CATAGCCGCT TTCGCAGCCA TCGCCAGGAA ACACGGCATA
CCGCTCATGG TCGACGCCAC CGTTTCCACG CCGTATCTCG CGCGGTCCAA GGAGCTCGGG
GCCGACATCA TCGTCCATTC CACCAGCAAG TACATAAACG GCACCGCCAA CTCCATCGGC
GGCGCCATCA TAGACGCAGG GAGCTTCAAC TGGCAGAGCC CGAAGTTCCC GCACTTCGAG
GAGTTTTACC GCAGGTACCG CGGCTTCGCC TTCACGGCAC GGGTCCGCAA GCTGATCCAC
AAGGATTTCG GCGCCTGCGC CGCGCCGCTC AACTCCTTCC TTTTGGGCGA GGGGCTGGAG
ACGCTCTCCC TGCGCATGGA GCGGCACTGC GCCAACGCCC TCCAAGTGGC CCGCTTCCTT
CAGGCGCACG AAAAGGTCGC CTGGGTCAAC TACCCCGGCC TCGACGACTC CCCCTTTCAC
GAGGTGGCGA AGCGCCAGTT CGACGGCCGC TTCGGCGGGC TCCTGACCTT CGGACTCGCG
GACAGGGCCG CCGCCTTCCG GGTCATCAAC AACCTGCGGC TGGCCAAGAA TCTCGCCAAC
ATCGGTGACA CCAAGACCCT GGTGATCCAC CCGGCGAGCA CCATCTGCGC CGATTACACC
CCCGAGGTGA AGGCGCTCAT GGGAGTGAGC GAGGAGCAGG TCAGGGTCTC GGTGGGTATC
GAGGACATCG AGGATATCCT GGAGGATTTT GCGGCCGCGC TGGAAGAGGC CTGA
 
Protein sequence
MGEKEKKLRF NTKLIHGGTS PGPSGATKTP IVQASAFAYD TAEALEDIFR GRAVGQVYTR 
IGNPTMDTLE KRLAAIEDGI AAVVTSSGMA AITTAVMGVV RSGDEVLSSS SLFGGTYSLF
HDTLANFGIY TRFVDPVDLA AVEAGINDKT RLIFVETIGN PKMDVPDIAA FAAIARKHGI
PLMVDATVST PYLARSKELG ADIIVHSTSK YINGTANSIG GAIIDAGSFN WQSPKFPHFE
EFYRRYRGFA FTARVRKLIH KDFGACAAPL NSFLLGEGLE TLSLRMERHC ANALQVARFL
QAHEKVAWVN YPGLDDSPFH EVAKRQFDGR FGGLLTFGLA DRAAAFRVIN NLRLAKNLAN
IGDTKTLVIH PASTICADYT PEVKALMGVS EEQVRVSVGI EDIEDILEDF AAALEEA