Gene Csal_1721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1721 
Symbol 
ID4028829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1959793 
End bp1960938 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content66% 
IMG OID637966909 
Productglycosyl transferase, group 1 
Protein accessionYP_573772 
Protein GI92113844 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.394257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATA AAACGATCGT GCTGGTCTCC AACACGTCCT GGTTTCTTTA CAACTTCTGC 
CAGGGGTTGT TGAGGGCCCT GGAACGACGC GGCTTCCGGG TGGTGTGCCT GGTACCCCAT
GACGACTACT CGCAGCGCTT GTGCGAGGAG TTCAACGTCA CGCTGCGCAC CATGCCCATG
GACGGCAAGA GCACCGGCCC CGCTCGCGAG GGCAAGTGCC TGTTCTGGCT GTTCGGCCAA
CTGCGCGAGC TGCGCCCTGC CTTCGTCTTC AACTTCACCA TCAAGGCCAA CATCTATTCG
GGGCTGGCCT GCCGCGCGCT GAACCTTCCC TACGCCAACA CCGTGACCGG TCTGGGCACG
GCGTTTCTCC ACGACAGTCG GCTCTTCCGT CAGGTGCGAC GCCTGTACGG GGTGGCCAAC
GCCGGCGCGA CACGGGTGTT CTTCCTCAAC CCGGATGACC GCGAACTCTT CGAGCACGAG
GGCATGTTGC AGAAGGTCGA CTGGGACATG TTGCCCGGTG CGGGCGTCGA TGTGGCGCGC
TTCGGGTTCA AGCCCTTGCC CACCGGGGAG CCGTTCACCT TTCTGCTGAT CGCCCGGTTG
CTGGGCGACA AGGGCGTGCG CGAGTACGTG GCCGCGGCCG AGCAGGTACG CGCCACGCAT
CCCGAGACGC GCTTTCTGAT CGTCGGCCCC AAGGGCGTCA GCAACCGTAC CGCGATCGAG
GACGATGAGG TCGATGCCTG GCACGCGGGC GGGGTCGTCG AGTACGTCGG TGCCCAGGAC
GACGTGCGGC CCTGGCTCGC CCAGGCGCAT GTGCTGGTAC TGCCGTCCTA TCGCGAGGGC
ATGCCGAGCA CGGTGATGGA AGCGGCGGCG ATGGGGCGAC CGGCCATCGT GACCGATGTG
CCGGGATGCC GGCATGCCGT GACCGAGGGC GAGACCGGCT GGCTGTGCCG CGTCAAGGAC
GCCGAGGCCC TGGCCACGCG CATGCGGTAC TGCCTGGCGA TGACGCCCTG GGCGCTTTCC
CGCGTGGGGG TGGCGGCGCG CCGGCGTGCG GAGAAGGAGT TCTCGCAGGA CATCGTCGTG
ACGGGATACC TGGCCTGCCT GGAAAGCGGC CTGACGAGTC ACCAACGCGT GAAAATGGAG
GCATGA
 
Protein sequence
MNDKTIVLVS NTSWFLYNFC QGLLRALERR GFRVVCLVPH DDYSQRLCEE FNVTLRTMPM 
DGKSTGPARE GKCLFWLFGQ LRELRPAFVF NFTIKANIYS GLACRALNLP YANTVTGLGT
AFLHDSRLFR QVRRLYGVAN AGATRVFFLN PDDRELFEHE GMLQKVDWDM LPGAGVDVAR
FGFKPLPTGE PFTFLLIARL LGDKGVREYV AAAEQVRATH PETRFLIVGP KGVSNRTAIE
DDEVDAWHAG GVVEYVGAQD DVRPWLAQAH VLVLPSYREG MPSTVMEAAA MGRPAIVTDV
PGCRHAVTEG ETGWLCRVKD AEALATRMRY CLAMTPWALS RVGVAARRRA EKEFSQDIVV
TGYLACLESG LTSHQRVKME A