Gene Csal_1970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1970 
Symbol 
ID4027210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2226196 
End bp2227308 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content65% 
IMG OID637967166 
Productmannosyl-glycoprotein endo-beta-N-acetylglucosamidase 
Protein accessionYP_574021 
Protein GI92114093 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1705] Muramidase (flagellum-specific) 
TIGRFAM ID[TIGR02541] flagellar rod assembly protein/muramidase FlgJ 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTCG ATGGACTCAG CAATCAGTTC GCCCTCGATG TGCAGTCGTT GTCGCGCCTC 
AAGCACACGG CGAGCCAGTC GCCGGAGAAG GGCTTGTCCC AGGCGGCGGA CCAATTCGAG
GCGATCTTTC TGCAGATGAT GCTCAAGAGC ATGCGCGACG CCATACCGCA GTCGGACCTG
CTGAGCAGCA ACGAGACCGA TACCTATACC TCGATGCTCG ACAAGCAGTG GGCGCAGAAG
ATGGCGGGGC ATGTCGGCCT CTCGGACATG CTGGTCGAGC AGCTTCAGGG GCGGGGCCTC
GTGGGGCGCG ATGAGGAGGT GACGCGCAAC GACCTGATCG CGGGGATTCC CCGCGGCACG
CCACGTGTCT TGAGCGATCC GATCGTGCCC CACGAGGCCG CCTCCAAGGA TTCAGGGCCC
GGGGATGACG CCGTGACCTC GGCTTCCGGC GCTTCGTCCT CGAGCGCACC GTCGGAGGTG
GCGACGAGTC GCGAGATGTC ACCGTCGAGT GCCGATATCG AGGACGCACG AGCGGCGCCG
CACGTCGAGG CGTTCCTGTC GCGGCTGCAT GAGCCCGCCG AAGCCGCCGC CCGCGAAAGC
GGTGTGCCGG CATCGTTGAT CCTGGCCCAG GCGGCGCTGG AAACCGGCTG GGGCGAGCGT
GAGATTCCCG CGCGCGATGG CGGCAACAGC CACAACCTCT TCGGTATCAA GGCGACCGGT
GGCTGGGATG GCGAGGCCAC CAGCATCACC ACCACCGAAT ATGTCGACGG TCGTGCCCGC
CAACAGGTCG ACGAGTTCCG TGTCTACGAT TCCTTCGAAG CCGCGTTCAA GGATTACGCC
GAGTTGATCG GCGGCAATCC ACGTTATGCC GGGGTGGTCA CGGCTTCGAC GCCGCAGAAC
GCCGCCCGAG CTCTGCAATC CGGCGGCTAT GCCACCGACC CGAACTATGC CGACAAGGTG
ATCGCCGTCA TGGCGCAGAT CGACGACCGT CTTGCCAGCG GGCCGACCCT GGCCAGCACC
GCCGAGGTCA GCGAGTCGCA AGGCGGCGCG CCGACGCGCA ACGGTTCGTC GGATCCCTAT
GATATATCGC GGATGCCCAC GGGAATTTTT TGA
 
Protein sequence
MSVDGLSNQF ALDVQSLSRL KHTASQSPEK GLSQAADQFE AIFLQMMLKS MRDAIPQSDL 
LSSNETDTYT SMLDKQWAQK MAGHVGLSDM LVEQLQGRGL VGRDEEVTRN DLIAGIPRGT
PRVLSDPIVP HEAASKDSGP GDDAVTSASG ASSSSAPSEV ATSREMSPSS ADIEDARAAP
HVEAFLSRLH EPAEAAARES GVPASLILAQ AALETGWGER EIPARDGGNS HNLFGIKATG
GWDGEATSIT TTEYVDGRAR QQVDEFRVYD SFEAAFKDYA ELIGGNPRYA GVVTASTPQN
AARALQSGGY ATDPNYADKV IAVMAQIDDR LASGPTLAST AEVSESQGGA PTRNGSSDPY
DISRMPTGIF