Gene Csal_1708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1708 
Symbol 
ID4028816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1940710 
End bp1941879 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content67% 
IMG OID637966896 
Productpeptidase M20D, amidohydrolase 
Protein accessionYP_573759 
Protein GI92113831 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.971842 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACCG TCACCCCCAC ATTGCTGCGC GAATGGCGCC ACGAGTTTCA TCGCCGGCCG 
GAGACCGCGT TCGAGGAACA TCACACCAGC GCGCGCATCG TCGAGATTCT CGAGGACGCC
GGTATCGAGA AGGTCACCGG CCTCGGCGGC GGCACCGGTG TCGTCGCCTG GGTGGACGGT
CGACATGGCG GCGAGCGCGC CATCGGGCTG CGCGCCGATA TCGACGCCCT GGACGTGCTC
GAAGCCAACG ACGTTCCTCA TGCCTCGACG ACGCCCGGCA AGATGCACGC CTGCGGGCAC
GACGGCCATA CCACCATGCT GCTGGGCGCG GCCTGTGCCC TCGCCGAGGC GCCCGACTTT
GCCGGCCGGG TGTACTTCAT CTTCCAGCCG GCGGAAGAAA ACGAAGGCGG CGGACGCGTC
ATGGTCGAGG AAGGCCTGTT CACGCGTTTC CCGATGGAAG CCGTCTACGG CGTGCACAAC
TGGCCGGGCC TGGCGGTCGG CGAAGCCGCC GTCCATGACA CGGCGGTCAT GGCGGCCTTC
GATGTCTTCC GCGTGAAGCT CACGGGGCAC GGCTGTCATG CCGCCATGCC ACACCTGGGC
AAGGATGTGG TACTGGCGGC CTGCCAACTG GTCAATCAGC TGCAGGGCAT CGTCAGCCGG
GAAACCCCGG CGCACCAGAC CGCCGTGATG AGCGTGACCC AGTTCCATGC CGGGGATGCC
TACAACGTCA TGCCCGAAAC CGTGGAGCTG TGCGGCACCG TGCGCTGTTT CGACCCCGAG
CTGCGCGACC ACCTCGAAAC GCGTTTTCGG CAGGCGATCG CGGCCATGGC CACCTTCCAT
GGCCTGGAGG CCGACATCGA CTACCAATCG CGCTACCCGG CCACCTTCAA CACCCCCGCG
CACGCCGCGC GCTGTGCGGA GGTGCTGGAG ACGCTGCCGG ACATTCACCG GGTGCATCGC
GACCTGCCGC CCTCCATGGC ATCGGAGGAC TTCGCCTTCA TGCTCCAGCA GCGCCCCGGC
GCCTATATCT GGCTGGGCAA CGGCGAGGAC AGCGCGTCGC TGCACAACCC GCATTACGAC
TTCAACGATG CCCTGGCGCC CATCGGGGTG GCGTATTGGG CGGCGCTGGC GAGAACACTA
CTCGACAACG GTGAACGAGA CGCGCCCTGA
 
Protein sequence
MTTVTPTLLR EWRHEFHRRP ETAFEEHHTS ARIVEILEDA GIEKVTGLGG GTGVVAWVDG 
RHGGERAIGL RADIDALDVL EANDVPHAST TPGKMHACGH DGHTTMLLGA ACALAEAPDF
AGRVYFIFQP AEENEGGGRV MVEEGLFTRF PMEAVYGVHN WPGLAVGEAA VHDTAVMAAF
DVFRVKLTGH GCHAAMPHLG KDVVLAACQL VNQLQGIVSR ETPAHQTAVM SVTQFHAGDA
YNVMPETVEL CGTVRCFDPE LRDHLETRFR QAIAAMATFH GLEADIDYQS RYPATFNTPA
HAARCAEVLE TLPDIHRVHR DLPPSMASED FAFMLQQRPG AYIWLGNGED SASLHNPHYD
FNDALAPIGV AYWAALARTL LDNGERDAP