Gene GM21_1241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1241 
Symbol 
ID8136566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1446701 
End bp1447945 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content63% 
IMG OID644868855 
Productamidohydrolase 
Protein accessionYP_003021060 
Protein GI253699871 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value2.29686e-19 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATAT ACGCCGCGTC ATATCTGCTT CCCATCTCCT CTCCCCCCAT CGCCGGCGGC 
GCCGTCGCGG TGGAGAACGG AGTGATAGCC GCCGTCGGAA CACTGCCCGA GGTCTCCACC
GTCTGCGGGG CACCGGTCAC CGATCTGCCG GGGTGCGTCA TCATGCCCGG GCTGGTCAAC
GCCCACACGC ACCTGGAACT GACCCATTTC CCCGCCTGGA AGCTGCGCAA GGACCTCGAC
TATCTCCCCA AGCGCTACGT GGAATGGATC CAGCAGGTGG TGAAGATCAA GCGCGCCCTT
TTGCCCGGGG AGATGGAGCA TTCGATCCGG GAAGGGATCC GCCTTTGCCT TGAATCCGGC
ACCACCTCCG TCGGCGACAT ACTCTCCGAC TTTTCCTTGG CCCCTCTCTA CCTCGACACG
CCGCTTGCCG GCAGGGTGTT CCTGGAGGCG ATAGGGCACG ACCCCACGCA GGGGGAAAAC
CTGTTGCGGC GGATCGAGAC GACGCTCGAC ATCTTCGCGG GGAGCATCCT GCCGGCGATC
TCTCCGCACA CCCCCCACAC CGTGTCGTCG CAGCTTTTGC AGGCCTTGCA CGCTCTGGCC
GTAAGCCGTG CCATACCGAA GGCAATCCAC CTCTCGGAAA CCGCGGACGA AGCCTCCTTC
ATGCATGACA CCACCGGGGA GATCGCCGAA CTCATCTATC CCATGGCGCA CTGGGAAGAG
TATCTGCCGC ACCCGATGTA CACCACCTCC ACCCGTTTCC TTTGCGATCT TGGCGTCCTC
GACCGCTCCA CCCTTGCCGT CCATGCCGTG CACGTCACCA TGGACGACGT GAGACTGTTA
AAGGAAAAGG GTTGCAGCGT GGTCCTCTGC CCTCGCAGCA ACGACCGGCT TTTCGTCGGC
ACCGCACCGC ACAAGCTCTT GAAGAAGGCC GGAGTTCCGC TCGCCCTGGG GACCGACTCC
CTGGCGAGCA ACGACTCCCT TTCTCTTTGG GACGAGGTGC GCTACCTGCA GCAGCAGGCA
CAAGGCGTCT TCAGCGCCGA AGAACTCATC GCCATGGCGA CCATAGGGGG AGCCCGGGCC
TTGCAGATAG AGGCGAGCGC CGGTTCCTTG GAGCCAGGCA AGCGCGCCGA CTTCCAGGTT
CTTTCCTTGG GCAGCGTCAG TGAGACTTCC GTCCACGCCG CCCTTCTGTC CAAGGGGCGT
CTGGAGCAGG TCTACGTCGC CGGCGAGAGG TACCCGAAAC AGTAG
 
Protein sequence
MKIYAASYLL PISSPPIAGG AVAVENGVIA AVGTLPEVST VCGAPVTDLP GCVIMPGLVN 
AHTHLELTHF PAWKLRKDLD YLPKRYVEWI QQVVKIKRAL LPGEMEHSIR EGIRLCLESG
TTSVGDILSD FSLAPLYLDT PLAGRVFLEA IGHDPTQGEN LLRRIETTLD IFAGSILPAI
SPHTPHTVSS QLLQALHALA VSRAIPKAIH LSETADEASF MHDTTGEIAE LIYPMAHWEE
YLPHPMYTTS TRFLCDLGVL DRSTLAVHAV HVTMDDVRLL KEKGCSVVLC PRSNDRLFVG
TAPHKLLKKA GVPLALGTDS LASNDSLSLW DEVRYLQQQA QGVFSAEELI AMATIGGARA
LQIEASAGSL EPGKRADFQV LSLGSVSETS VHAALLSKGR LEQVYVAGER YPKQ