Gene Csal_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1047 
Symbol 
ID4027835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1180318 
End bp1181430 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content63% 
IMG OID637966224 
Productaminopeptidase DmpA 
Protein accessionYP_573103 
Protein GI92113175 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3191] L-aminopeptidase/D-esterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0633499 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGAA TGAGAGCACG CGAGATGGGG CTGCCGCTGC CCGGCGAACC GGGCTTGAAC 
AACGCGATTA CGGACGTGCC GGGCGTGTTG GTCGGCTATG AAACGATTGA TGGAACGGCC
GAAAACGGAC GGCCCATCAA GACCGGCGTG ACCGCCATTC TGCCGCGCAC CCGAAGCAAC
ACGCCTGCTC CTGTCTGGGC CGGCTTTCAT GCGCTCAACG GCAACGGGGA AATGACAGGG
ACGCATTGGA TCGAGCAAGG CGGCTATTTC GTCGGCCCGA TATGCCTGAC CAATTCGCAT
AGCGTGGGCA TCGTTCATCA CGCCGCGACA CGCTGGATGC TCGATACCTA CGCGCAGACT
TTCGACGCAC ACCACCTGTG GGCAATGCCC GTGGTCGCGG AGACCTATGA CGGCGTACTC
AATGACATCA ACGGTCAGCA CGTGGAAGCC GCCCATGTCC ACGCCGCCCT CGCCAGCGCA
AGTGGCGGCG CCATAGAAGA GGGAAACGTC GGCGGCGGCA ACGGCATGAT CTGTTACGGC
TTCAAGGGCG GTACGGGTAC CGCTTCGCGC CGTGTCGGCA TCGATGGACA GGACTACACG
TTGGGCGTGC TTGTCCAGGC CAACCACGGC AAGCGGGACT GGTTGAACGT ACTAGGCGTG
CCTGTTGGAG AGGCACTGCA TGATGCCGAC TTGCCCGAAG AGCTCAATCG CGAACGCGGC
TCCATCATCG CCGTCATCGC GACAGACGCT CCCATGCTGC CCCATCAGCT CAAGCGCCTC
GCCCAACGCG CCGGACTGGG TATCGCACGC TCCGGCAGCC CCGGCGGCAA CGATTCAGGC
GATATGTTTC TGGCCTTCAG CACGGCGAAC GAGGGTCCCT TGCCCCAGCT CGGGCCGGCC
CGGCAACAGA TGCACCACAT GAACGACGAG TATTTCGATG ACTTCTACAT GGCGGTCGTG
CAAGCGACGG ACGAAGCCGT CCTCAATGCC ATGTGCATGG CCAGGGGAGC GCCCATGGCA
AAGCCGGAGG GCTGGTGCCC AGCTCTCGAT CCGGAACGGC TCGAGCCGTT ACTACGCCGG
GCCGGTATCA GCATAGGAGA ACGTAACGAT TGA
 
Protein sequence
MQRMRAREMG LPLPGEPGLN NAITDVPGVL VGYETIDGTA ENGRPIKTGV TAILPRTRSN 
TPAPVWAGFH ALNGNGEMTG THWIEQGGYF VGPICLTNSH SVGIVHHAAT RWMLDTYAQT
FDAHHLWAMP VVAETYDGVL NDINGQHVEA AHVHAALASA SGGAIEEGNV GGGNGMICYG
FKGGTGTASR RVGIDGQDYT LGVLVQANHG KRDWLNVLGV PVGEALHDAD LPEELNRERG
SIIAVIATDA PMLPHQLKRL AQRAGLGIAR SGSPGGNDSG DMFLAFSTAN EGPLPQLGPA
RQQMHHMNDE YFDDFYMAVV QATDEAVLNA MCMARGAPMA KPEGWCPALD PERLEPLLRR
AGISIGERND