Gene Tgr7_1531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTgr7_1531 
Symbol 
ID7318018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. HL-EbGR7 
KingdomBacteria 
Replicon accessionNC_011901 
Strand
Start bp1640946 
End bp1642265 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content69% 
IMG OID643616422 
ProductN-ethylammeline chlorohydrolase 
Protein accessionYP_002513602 
Protein GI220934703 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACCA TCGACACCCT CATCAAGGCA CGCTGGATCA TTCCGGTGGA GCCCGACGAC 
ACCGTCCTCG AACACCATGC CCTGGCCATC CGCGCCGGGC GCATCGTGGC GCTCCTGCCC
TCGGCCGAGG CCGATGACCG CTACCGGGCG GACAAGGTCC ATGAACTGCC CCACCACGCC
CTGATCCCGG GGCTGGTGAA CACCCACACC CATGCCGCCA TGAGCCTGAT GCGCGGTCTG
GCCGATGATC TGCCCCTGAT GGAATGGCTC AAGGGCCACA TCTGGCCCGC CGAGGGCCGC
TGGGTGGGCG CGGAGTTCGT GGAGGACGGC ACTCTGCTGG CCATGGCGGA GATGCTGCGC
GGCGGGGTGA CCTGCTTCAA CGACATGTAC TTCTTCCCGG AGATCACCGC CCACGCCGCC
GCGCGGGCGG GCATGCGCGC GGCACTGGGC CTGATCGTCA TCGATTTCCC CACCGCCTGG
GCGGCCAATG CGGATGAGTA CATCGCCAAG GGCCTGGCCC TCTACGACGA CCACAAGGAC
GAGGCCCTGC TGTCATTCTG CTTCGCGCCC CACGCGCCCT ATACCGTCTC CGACGAGCCC
CTCAAGCGCA TCCGGACCCT GGCCAATGAG CTGGACCTGC CGGTACACAT GCACGTGCAC
GAGACCGCCC ACGAGGTGGA AGAATCCATG GCCCGCTTCG GCATGCGCCC GCTAGAGCGC
CTGGCGCAGC TGGGCCTGGT GGGCCCCAAC CTGCTGGCCG TGCACATGAC CCAGCTGGAA
GATGCCGAGA TCGCCCACCT GGCCGAGGCT GGCGCCCACG TGCTGCACTG CCCTGAATCG
AACCTCAAGC TGGCCAGCGG TTTCTGTCCG GTCCAGAAAC TCCTGGATGC CGGGGTGAAC
GTGTGCCTGG GCACCGACGG CGCCGCCAGC AACAACGACC TGGACCTGAT GGGCGAGATG
CGCACCGCCG CGCTGCTGGC CAAGGGGGTG GCCGGGGATG CCGCCGCCCT GCCTGCCGCT
GCGGCCCTGC GCATGGCGAC CTTGAACGGC GCCCGGGCCC TGGGGCTCGG CGAGGAGACC
GGCTCCCTGG TGCCCGGCAA GGCAGCGGAC GTGGTGGCCG TGGATCTCGG CGCCCTCGAA
AGCCGCCCCG TGTATCACCC CGTCTCACAC CTGGTCTATG CCACGGGCCG ACAGCAGGTG
ACCCATGTCT GGGTGGCCGG CAAGGCCCTG CTGAAGGACC GCCGCCTGAC GACCCTGGAC
CTGGAGGCGA TCCAGGCCAG GGCGATGGCG TGGCAGGAGC GCTTGAAGGC AAATGCATGA
 
Protein sequence
METIDTLIKA RWIIPVEPDD TVLEHHALAI RAGRIVALLP SAEADDRYRA DKVHELPHHA 
LIPGLVNTHT HAAMSLMRGL ADDLPLMEWL KGHIWPAEGR WVGAEFVEDG TLLAMAEMLR
GGVTCFNDMY FFPEITAHAA ARAGMRAALG LIVIDFPTAW AANADEYIAK GLALYDDHKD
EALLSFCFAP HAPYTVSDEP LKRIRTLANE LDLPVHMHVH ETAHEVEESM ARFGMRPLER
LAQLGLVGPN LLAVHMTQLE DAEIAHLAEA GAHVLHCPES NLKLASGFCP VQKLLDAGVN
VCLGTDGAAS NNDLDLMGEM RTAALLAKGV AGDAAALPAA AALRMATLNG ARALGLGEET
GSLVPGKAAD VVAVDLGALE SRPVYHPVSH LVYATGRQQV THVWVAGKAL LKDRRLTTLD
LEAIQARAMA WQERLKANA