Gene Csal_1852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1852 
Symbol 
ID4028372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2106592 
End bp2107692 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content65% 
IMG OID637967046 
ProductTonB-like protein 
Protein accessionYP_573903 
Protein GI92113975 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain
[TIGR02794] TolA protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAAC GTGAACGCCG TGTAGGCTAT GGGTGGCCCA CGCTACTGGC GATTGGCGTG 
CATGTCGCGG CGCTGTTGTT GACCATCATC AAATGGCCTG CCAGCGAGCC CGAACCCACG
TCGTCATCGG TGGTACAAGC CACCTTGGTG CGCGCCGAAA CGGCGACGGA TCAGCCTCAG
CAAACCGAGG AATCCGCGCC TGCCGCCTCG CAGGCCGAGG AGCAAACGCC GGACGAGCCG
CCCGCGCCCG ACACGAACGA AGAAGCCCCC AGCGAACCGG ACGAGACCAG CGTCCCGGAG
GTATCGGACA CCGCGGTGGA ACAAGCCCGA GCAGAGGCAC AGCGCTTGCA GGAAGAGGCC
GAACGCGCCG CGCAAGCCCG CGCGGAAGCC GAGGCGGAAC GCCAGCGCCA GCAGGAAGAA
GCCGAACGCC AGGCCGAGGC GGAGCGTCAA CGCCAGCAGG AAGAGGCCGA GCGCCAGGCC
GAGGCGGAGC GCAAGCGTCA GCAGGAAGAA GCCGAACGCC AGGCCGAAGC GGAGCGCAAG
CGTCAGCAGG AAGAAGCCGA ACGCCAGGCC GAAGCGGAGC GCAAGCGCCA GCAGGAAGAA
GCCGAACGCC AGGCCGAAGC GGAGCGCAAG CGCCAGCAGG AAGAGGCCGA ACGCCAGGCC
GAAGCGGAGC GCAAGCGCCA GCAGGAAGAG GCCGAGGCGG CGCGGCAGAG AGAACTTGCC
GAACGGGCCG CCCAGGCCAA TGCCTCTTCT CTGGAAAGCA TGATCAGCGA GGAACAGCAA
TCCATCGCCA ATGCCGAGCA GGCCCGAGAG GCTGCCAATG GCTTCAAGAA CCTGGTGCGT
CGCTACGTCG AGCAAAGCTG GAACCTGCCA CCCAGTGCAT CGCGACAATT GCGGGCCATC
GTGCGCATTC AGCTATTGCC CACGGGCGAA CTGGTGGGAG CCACCATCAC GCAAAGCAGT
GGCGATGCGG CCTTCGACCG CTCGGTGATC AACGCCGTCG AAAGAGCTGC ACCGTTTCGT
GAAATGAGTG AACTCGACGC CTCGGTACAA CGTCAGTTCC GTGAATTCAA TCTGGACTTC
AACCCAGAGG ACATCCGCTG A
 
Protein sequence
MAKRERRVGY GWPTLLAIGV HVAALLLTII KWPASEPEPT SSSVVQATLV RAETATDQPQ 
QTEESAPAAS QAEEQTPDEP PAPDTNEEAP SEPDETSVPE VSDTAVEQAR AEAQRLQEEA
ERAAQARAEA EAERQRQQEE AERQAEAERQ RQQEEAERQA EAERKRQQEE AERQAEAERK
RQQEEAERQA EAERKRQQEE AERQAEAERK RQQEEAERQA EAERKRQQEE AEAARQRELA
ERAAQANASS LESMISEEQQ SIANAEQARE AANGFKNLVR RYVEQSWNLP PSASRQLRAI
VRIQLLPTGE LVGATITQSS GDAAFDRSVI NAVERAAPFR EMSELDASVQ RQFREFNLDF
NPEDIR