Gene Csal_3107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_3107 
Symbol 
ID4028748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp3461834 
End bp3463444 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content67% 
IMG OID637968321 
Productprotein of unknown function DUF513, hemX 
Protein accessionYP_575150 
Protein GI92115222 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2959] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.52248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAC GACAGGACCA GGATGACAAG CAAACGCCCG CCGAGGGGGC GACGTCCAGC 
GATGAGACGT CCGAGTCGCC GGCCACCCGT GACACGACGT CGGGCAAAGC CGATACGGCG
GACCGAGTAG CCGACGCATC GCACGGTGCC TCGGACAAGG CGGCGGAGGA CGCTGCGTCG
TCCAAGGCAT CGTCGAGTGA CGCGTCGACG GTCTCGACGT CGGGAGATAC GGCATCCTCG
AGTGACGAGA CGCCCGCCTC CAAGACGTCT ACCTCCTCCC GCTCAGCGTC CCCCACACAG
CGCGATGCCC AGGCCGCGCG TGACACGTCG ACCCAGTCGA GCACGCAGGG CAAGACGGCC
TCGTCGTCCT CGAAAAGCGC CACGCGTGCT GCGTCCAAGC GCACAGGCAA GGCTGCCGGA
GGGAGCTCTT CCGGGGCGGC GTCCACCGCC GGCGCCAGTG CGTCGGGCAA GCCGAGCGCG
AGCAAGGCTG AGGCGAGCCA GGGCACGCCG CCGCCCGGTG GTGCGTCGAC GACTCCATCG
CGAGGCGGCA ATGGCAGTGG GCGCGGCTTG GCGATCCTGG CTCTGGTGAT TGCCGTGATC
GTGGCCATCG TCGTGGCGCT GGGGCTGGGG TATGGCTGGA AGCGTCTGCA GGCACAGCAG
GCCGAGCTGG CCAGCGCGGG CGAGGAGAAT GCACAGACCA TCGCGACGTT GCGTGATCGG
CTCGATCAGC GTGAGCAGGC CTTCGCGTCG TTGCGCGACG ATTTTGCGTC GTACCGCCAG
GATGTCGACG ACAACCTGGA CAAGGTGCTG GCCGAGCTGG CTGAGGAGCA GGACGCCGAT
CCTCGCGAGT GGTTGCATGC CGAGGCCGAA TACCTGCTGC GACTGGCCAA CCAGCGGCTG
CGCCTGGAGC GTGACGTGCA GGGCGCCCAG GCCTTGCTCG AGGCCGCCGA CCAGCGGCTT
GCCGAGGCGG ACAATCCGGC CCTGATCCCG GTCCGCCGGG CGATCCAGTC CGAGCTTGCG
GCACTGGATT CGGTGCCTCG GGTGGATCAG ACGGGGCTGT ATCTGGCGCT GATGGCACAG
CAGGAGCAGC TGGCGACATT GCCGCTCAAG CAGGACGTGG AAGAGATCGC GGCTCAGAAC
GCCGACGACT CGCCGCTGGA AGGTGGCTGG CGCGAACAGC TCGCGCGGCT GGGCGGCGAG
CTCAAGGATC TCGTCGTCGT GCGGCGTCAT GACGAGGCGC TGGAAGCCCT GATGACGCCC
GAGCAGGAAT CGTATCTGCG TCAGAACGTA CGGCTGCTGC TCGAGCAGGC GCAACTGGCC
CTGCTCAAGA CCGAACCCAA GCTCTATCGG GCCAGTCTCG ACAAGGCCAC GACCCTGATC
GAGCGCTATT ACGATACCGA GCGCGAGGCA GTGACGACCT CCCTCGAGCG TCTCCGGTCG
ATGCGCGACA AGACGATCCG TCCCGAATTG CCGGATATCA GCGAGTCTCA GCAGACGCTC
AAGGATTTCA TCGAGAAGCG CTTCCAGGAC AATGGCAGTG ACGGCAACGC GACCCGGCAG
GGTGCCGGCG ATGCCACGGC CCCATCGGGC GATGAAGGAG ACAGCGCATG A
 
Protein sequence
MSKRQDQDDK QTPAEGATSS DETSESPATR DTTSGKADTA DRVADASHGA SDKAAEDAAS 
SKASSSDAST VSTSGDTASS SDETPASKTS TSSRSASPTQ RDAQAARDTS TQSSTQGKTA
SSSSKSATRA ASKRTGKAAG GSSSGAASTA GASASGKPSA SKAEASQGTP PPGGASTTPS
RGGNGSGRGL AILALVIAVI VAIVVALGLG YGWKRLQAQQ AELASAGEEN AQTIATLRDR
LDQREQAFAS LRDDFASYRQ DVDDNLDKVL AELAEEQDAD PREWLHAEAE YLLRLANQRL
RLERDVQGAQ ALLEAADQRL AEADNPALIP VRRAIQSELA ALDSVPRVDQ TGLYLALMAQ
QEQLATLPLK QDVEEIAAQN ADDSPLEGGW REQLARLGGE LKDLVVVRRH DEALEALMTP
EQESYLRQNV RLLLEQAQLA LLKTEPKLYR ASLDKATTLI ERYYDTEREA VTTSLERLRS
MRDKTIRPEL PDISESQQTL KDFIEKRFQD NGSDGNATRQ GAGDATAPSG DEGDSA