Gene Csal_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0201 
Symbol 
ID4027166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp220634 
End bp221989 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content65% 
IMG OID637965352 
Productpeptidase 
Protein accessionYP_572264 
Protein GI92112336 
COG category[S] Function unknown 
COG ID[COG3182] Uncharacterized iron-regulated membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCGTA CTTCTGCGCG TTCTTCACGT GACGCCAGCG ACCTCTATCG GGCGGTCTGG 
CGCTGGCATT TCTACGCGGG TCTCATCGCG ATTCCCTTTC TGATCTCGTT GGCGGTGACC
GGCGGGCTCT ACCTCTTCAA GGATGAGATC GACCAGTGGC TCAACGCGGA CATCGTGCGC
GTCGAGGCGC AATCGTCAGC GGCGGTGTCG CCGCAGCAAC AGCTGGATGC GGCCATGGCG
GCGCATCCCG GCGAGGCGTT TCGTTACGTG CCGCCGGCCG CCTCCAATCT TGCCGCGGAG
GTCGACATCA CCACCGCCGA CGGCAAGCAA GCCGTCTACG TCGACCCTTA CACCGGCGAG
GTCACTGGCA CGATTCCCTA TCGCGGCAGC GTGATGTGGA TCGTGCGCAC CATCCACAGC
CTCAGCTACT TCGGCGAGAC GGCGAGTCTG ATCATCGAGA TCGTCGGCGG CTGGTCGATC
CTGCTCGTGC TGACCGGAAT CTATCTGTGG TGGCCGCGCG GTCGACGCGG CGGTGTGATG
ACGGTACGTG CGACCCCGGC GAAGCGATTG TTCTGGCGTG ACCTTCACGC CGTCACCGGC
ATCTTCGTCG GGGGCTTCAT TCTTTTTCTG GCGATGACCG GCATGCCGTG GTCCACGCTG
TGGGGCAGCA AGGTCAACGA ACTCGCCAAC GGTCACAACT TCGGCTATCC GGATGGCGTG
CGCGTCAACG TCCCGGTCTC CGACGAACGC CTGGCGGAGC GGGAGATGAC TACCTGGTCG
CTGGAGCAGG CGCGGCTGCC GGAATCGACG CCTGGGCGTG AGGGCGCGCC GGGCATCGGC
CTCAACGGCG CGGTGAAGGC GTTCGACGCG CTGGGGCTGG CCCCAGGATA TGCCGTCAGC
CTGCCGAGCA GTCCTACCGG CGTCTATACC GGCTCGATCT ACCCCGACGA TCTTTCACGA
CAGCGGGTCG TGCATCTGGA CCGATATAGC GGCGAGCCAC TGCTGGACAT GAGCTACGCC
GACTATGGCC CGTTGGGCAA GTCGCTGGAG TGGGGCATCA ACGTGCACAT GGGCCAGCAA
TATGGGCTCG CCAATCAGTT GATTCTGGCG CTGGCCTGCG CGGGGATCGT GCTGCTGTGC
GTCTCGTCGG GCGTGATGTG GTGGAAGCGT CGACCGAGCG GCAAGCTGGG GATTCCCCCC
GAACCCAAGG ATCCCCGTCG CTTGCGCGGT GTGCTGGCGT TGCTCGCCAT CGGCGGCGTG
ATCTTCCCGC TGGTGGGGGC CTCGATGATC GTCATGGCGG TGGTGGATGC GCTGGTACGC
CGTCGCGGCG CCAGGCGCGC CGCGACGACC GCCTAG
 
Protein sequence
MSRTSARSSR DASDLYRAVW RWHFYAGLIA IPFLISLAVT GGLYLFKDEI DQWLNADIVR 
VEAQSSAAVS PQQQLDAAMA AHPGEAFRYV PPAASNLAAE VDITTADGKQ AVYVDPYTGE
VTGTIPYRGS VMWIVRTIHS LSYFGETASL IIEIVGGWSI LLVLTGIYLW WPRGRRGGVM
TVRATPAKRL FWRDLHAVTG IFVGGFILFL AMTGMPWSTL WGSKVNELAN GHNFGYPDGV
RVNVPVSDER LAEREMTTWS LEQARLPEST PGREGAPGIG LNGAVKAFDA LGLAPGYAVS
LPSSPTGVYT GSIYPDDLSR QRVVHLDRYS GEPLLDMSYA DYGPLGKSLE WGINVHMGQQ
YGLANQLILA LACAGIVLLC VSSGVMWWKR RPSGKLGIPP EPKDPRRLRG VLALLAIGGV
IFPLVGASMI VMAVVDALVR RRGARRAATT A