Gene Csal_1712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1712 
Symbol 
ID4028820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1948041 
End bp1949201 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content62% 
IMG OID637966900 
Productpolysaccharide export protein 
Protein accessionYP_573763 
Protein GI92113835 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.69321 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTTAC GCCGCAACGA GTCTCGTTCG GGATGGAGAA AGCCACTGCT GGGGCTTTCT 
TTCATGGTGG CGCTCTCGGG CTGTGCGTTC GCACCCGGTG GGCATATCGA TTATGACACG
CAGGGGGAGG ACCTTTCCGA GAACATCGAG GTCAAGCCGA TCACGCCGAG CCTGGTCAAG
ACGATGGCGG CGAGCGGGGA TGAGCTGCGC GAGTCGCTGT TCGAGTATGC CGAGAATGCC
GAGCCGCAGA TGGAGGATCT CGATTACGAC TACATGATCG GGCGCGGGGA CGTGCTGGCG
GTCGTCGTGT ATGACCATCC CGAACTGACG ATTCCCGCCG GTAGCGAACG CAGTGCGGAA
GAGTCGGGCA ACGTGGTGCA CTCGGACGGC ACCATCTTCT ATCCCTACAT CGGTACGGTG
GATGTCGCTG GACGCACCGT GCGCGATGTC CGCAGCGAGA TTCAGCGCCG CCTCGAAGGC
TACATCGCTC AGCCTCAGGT GGACGTGAAG GTCGCCGCCT TCAATGCGCA GAAGGCGTAC
GTCACCGGCC AGGTCGAACG TGCCGGTGCC CAGCCGATCA CCAACATTCC GCTCACCGTG
CTGGATGCCT TGAGCAACGT GGGGGGGCTG ACCCAGGGCG GCGACTGGCA TGACGTCGTG
CTCACGCGTG ACGGCCAGGA GATCCACCTG TCGGTGTACG ACATGCTGGT CAACGGCAAT
CTCGAACAGA ACCTGTTGCT CCAGGATGGC GACGTGCTGC ATGTGCCGGT GGTCGGCAAC
CAGCAGGTCT ACGTGATGGG CGAGGTCAAT ACGCCGACCG CGGTACCGAT GCCGAACGAG
CGTCTGTCGT TGACCAACGC CTTAGCCCAG GCCGGCGGCA TCAACGAGAA CAGCGCCGAT
GCCTCGGGGA TCTTCGTGAT TCGCCGCAAT CACGATGTCG AAAGCGACAC CTTCGCCACC
GTCTACCAGC TCAACGCCAA GAACGCGATC TCCTTCGTGC TGGGGTCGGA ATTCATTCTG
CAGCCCACCG ATGTGGTGTA TGTCACCGCC GCGCCGATTG CCCGCTGGAA CCGCGTCATC
AGCCAGATCC TGCCCAGCGT GACGGCGATC TACCAGCTGA CGCAGGCCAC GCGTGACATT
CAGGACATCG ACGATAACTA G
 
Protein sequence
MTLRRNESRS GWRKPLLGLS FMVALSGCAF APGGHIDYDT QGEDLSENIE VKPITPSLVK 
TMAASGDELR ESLFEYAENA EPQMEDLDYD YMIGRGDVLA VVVYDHPELT IPAGSERSAE
ESGNVVHSDG TIFYPYIGTV DVAGRTVRDV RSEIQRRLEG YIAQPQVDVK VAAFNAQKAY
VTGQVERAGA QPITNIPLTV LDALSNVGGL TQGGDWHDVV LTRDGQEIHL SVYDMLVNGN
LEQNLLLQDG DVLHVPVVGN QQVYVMGEVN TPTAVPMPNE RLSLTNALAQ AGGINENSAD
ASGIFVIRRN HDVESDTFAT VYQLNAKNAI SFVLGSEFIL QPTDVVYVTA APIARWNRVI
SQILPSVTAI YQLTQATRDI QDIDDN