Gene Csal_2174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2174 
Symbol 
ID4026668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2447058 
End bp2448797 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content65% 
IMG OID637967379 
Producttype II secretion system protein E 
Protein accessionYP_574224 
Protein GI92114296 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02538] type IV-A pilus assembly ATPase PilB 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTGT CTTCATCATC CACCTCCGAT ACGGAGGCAT CACCGCTGAA AGGGCTCGCC 
GCGCGCCTGG TGCAGGAAGG TCTGCTCGAG GAACAGCAGG CGCGCAGCGC CCTGGAGGCC
GTACGCGAAA GCCACCAGTC GATGCTCGAG TATGTCATCG AGCGCCAGTG GGTCCCTGCC
CGCGCGGCGA CCCAGGTTGC GGCCTGGGAG TACGGTCTGC CGTTCGTCGA TCTCGAGGCC
GTCGACCTCG GCCGGCTCCC CACCGGTGAG GGCCTGCCCG ATGCGATGAT ACGTCGTCTG
GGCATACTGC CCCTGGCGCG CAGCGAGCGG CATATCAGCG TGGCGGTTCC CTACCCCGCG
ACCCTGGCGA AACTGGACGA ACTGCAGTTC AGTACCGGCC TTTCCGCCGA GGGGGTCCTG
GTCGCCGCCG ATCAGTTGCA ACCCGCCATC GACAGCCACC TTGCCCAGCG CGAGACGCAC
GGCATGCTCG AGGCCCTTGG CGATGCCAGC GAAGCGCTCG AGACGCTGGA TCTCGACGAT
GTCGAGCCTC TCGAGGATGG GCTCGGTACC GAGGACGACG ATGCGCCGGT GGTGCGCTTC
GTGCACAAGA TGATGATCGA TGCGGCCCGT CGCGGCGCCT CGGACATTCA CTTCGAGCCG
TACGAGTCGA CCTTTCGCAT TCGTTTTCGC ATCGATGGCA TCCTGGTCGA GGTCGCACGC
CCACCATTCA ACCTGCGTGC GCGTCTGTCG GCACGCCTCA AGGTCATGTC GCGCCTGGAC
ATCTCCGAAC GCCGCCTGCC TCAGGACGGC GCGATCAAGC TGCGGCTGTC GGCATCGCGA
ACCATCGATT TCCGCGTCAG CACCTTGCCC ACGGTCCATG GCGAGAAGAT CGTGCTGCGT
CTGCTGGATG CCGGCGCCAC GCGCCTGGGG ATCGATGCAC TGGGGTTCAG CGAGGCGCAG
CGCGCGATGT TCGAGCACGC CCTGGCACAG CCCCAGGGCA TGATTCTCGC CACCGGCCCC
ACCGGCAGCG GCAAGACCGT GACCTTGTAC ACTGGGCTCA ATATCCTCAA CACCGACGCG
CGCAACATTT CCACCGCCGA GGATCCCGTC GAACTCAAGC TCGACGGCAT CAACCAGGTC
AGCGTTCAGC CGAAGATCGG GCTGGATTTC GCCAATGCCT TGCGCGCCTT CCTGCGCCAG
GACCCCGACG TGGTCATGGT CGGCGAGGTG CGCGACCTGG AGACCGCCGA AATCGCCGTC
AAGGCCGCGC AAACCGGACA CCTGGTGCTC TCGACGCTGC ATACGAATTC GGCGGCGGAA
ACGCTGACGC GCCTGCGCAA CATGGGGGTG GCGGCCTACA ACATCGCCAG TTCGGTGAGC
CTGATCGTCG CCCAGCGGCT CGTGCGACAA CTCTGCCCGC ACTGCAAGAC GCCCGTCGAG
CTGCCCCGGG AGCTCTTCAT CGAGGCCGGC TTCACGTCCA CCGACATCGA ATCGACGACC
TGCTATCGTG CCGCGGGCTG CGCCCAGTGT ACCCACGGGT ACAAGGGACG CGTGGGAATC
TACGAGGTAC TGCCCGTCAG CGAGGCCATG AGCAAGTTGA TCATGGCCGA CGGCAATTCT
CTGGAACTGG GGGAACTGGC CCGTCAAGAA GGCCACCCCG ATTTAAGACG CAGCGGACTG
AGCAAGGTGC TCGCCGGCAT CACCAGCCTC GAGGAAATCA ACCGCGTGGT GCTGGAATGA
 
Protein sequence
MPLSSSSTSD TEASPLKGLA ARLVQEGLLE EQQARSALEA VRESHQSMLE YVIERQWVPA 
RAATQVAAWE YGLPFVDLEA VDLGRLPTGE GLPDAMIRRL GILPLARSER HISVAVPYPA
TLAKLDELQF STGLSAEGVL VAADQLQPAI DSHLAQRETH GMLEALGDAS EALETLDLDD
VEPLEDGLGT EDDDAPVVRF VHKMMIDAAR RGASDIHFEP YESTFRIRFR IDGILVEVAR
PPFNLRARLS ARLKVMSRLD ISERRLPQDG AIKLRLSASR TIDFRVSTLP TVHGEKIVLR
LLDAGATRLG IDALGFSEAQ RAMFEHALAQ PQGMILATGP TGSGKTVTLY TGLNILNTDA
RNISTAEDPV ELKLDGINQV SVQPKIGLDF ANALRAFLRQ DPDVVMVGEV RDLETAEIAV
KAAQTGHLVL STLHTNSAAE TLTRLRNMGV AAYNIASSVS LIVAQRLVRQ LCPHCKTPVE
LPRELFIEAG FTSTDIESTT CYRAAGCAQC THGYKGRVGI YEVLPVSEAM SKLIMADGNS
LELGELARQE GHPDLRRSGL SKVLAGITSL EEINRVVLE