Gene Csal_0094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0094 
Symbol 
ID4026016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp118198 
End bp120234 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content63% 
IMG OID637965245 
ProductTonB-dependent receptor 
Protein accessionYP_572157 
Protein GI92112229 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0917922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCATC GCTACCTTCC TGCTTCCCTG CTGCTTGGCG CCTTGAGCGC TTCCGGCGCC 
GCGTTCTCCC AGACGATCTC GCCGGAGGTG ATGGAGGTGA CGGCGCCGCG CCTCGATCGC
GAGCTCTATG CCACACCCGC CGCCGTTTCC GTGGTCGACC GCGACAGCAT TGCGCAAGGC
CAGCAGCGTG TTCGGCTCGA CGAGTCGCTG GTGACCGTGC CCGGGGTGTT CCTGCAAAAC
CGTGACAACT TCGCCCAGGG CGAGCGCCTG GCGATTCGTG GCTTCGGCGC ACGGGCACCG
TTCGGTGTAC GTGGAGTCAC GGTGATGGCC GACGGGATTC CCTACACGCT GCCCGATGGG
CAGGCGCAGC TGGATGCCAT CGACCTCGAC AGCGCCCAGC GGATCGAGGT GATCCGCGGA
CCGTCGTCGG TGCTCTACGG CAATGCGGCC GGTGGCGTGC TGAGCGTGAC CACGGCCGAT
GGCCGTGACG ACCAGAAGAC TCGCCTGGGG GCCGAGATCG GCAGCGACGG CTACCGGAAA
TATCGCTTCA GCGATGGCGG CGTGAACGGC CCCTGGTCCC ATCATGTGAG CGTCTCGGCG
CTGAATTTCG ACGGGTACCG GGATCAGAGC CAGGTGGAGA AATACCATCT GAACGCCAAG
GTGCGCCGTG AGCTGGGCAA TGATCGGGCA CTGACGGCCA TCGTCAACTT GCTGGACAAT
CCGCGCTCCG AGGACCCGGG CGGACTGACG CGCGAGCAGG TCGATGAAGA CCGTAACCAG
GCCGGCGACT TCACCGAGGA ATACGACACG GGCCAGAACG TCGACCAGCA GGTGCTGGGG
CTGCAGTACG AGGATCTGTC CGCCGGGCCG GGCGAGCTGT ACGTCAAAGG CTTCTATCTA
CAGCGTGACT TCGAACAGCA ACTGCCCTAT CCCGGCGACA GTCTCCTCGG CTACGAGCGT
GACTACTTCG GGGGCAGTGC CGAGTATCAC CAGGATCTGC TGCTGGGCGA GCTGCCGCTG
CGATATGTGG TCGGTGTCGA TGTGGCGCGT CAAGAGGACG ATCGCTGGCG GCGTAACGTC
GAGTTCGATG GCACGGTCGG CGGTGACACC GCCGACGAGA CCCAGACGGC CACTTCGCTG
GGTATTTTCG CCCAGGGCGA TCTGGATCTC ACCGACAAGT TGACGCTATC GCTGGGGACC
CGCTACGACC GCGTCGACTT CGACATCGAC GATGATTTCG GCAGCGACGG CGACCAGAGC
GGCGACCGTA CCTTCCGCGA ATGGAGCGGC TCGGCGGGCT TGAGCTATCG GTACTTGCCG
ACGCATCAGG CTTATGTCAA TACCGGCACG TCTTTTGAAA CTCCCACGTT TTCTGAATTC
GCCAACCCCA GTGGCGTGGG CGGCTTCAAT CCTGCCGTCG AGCCACAGAA GGCCTGGAAT
CGCGAAATCG GGCTGCGGGG GAATTTCGAC AATGGCGTGG ATTACGATCT GGCGCTGTTC
TCGGTGCGTG TGCGCGACGA GCTGGTGCCT TACAACGAGA ATGGGCGGGA CTTTTACCGC
AATGCCGGCG ATTCCTCGCG GGATGGTATC GAGCTGGCGC TGGGCTGGCA GATGACGCCG
AGCTGGCGTC TCGACAGTGC CTTGACGCTG GCCAGGTACG AATTCGATGA ATACGACACC
CAGGATGGCA ACTACGGGGG CAACCGCATC CCCGGCCTGC CGGAGCAGAC CTGGATGAAC
CGGCTGACCT GGAAGGGCTT CGACGAGCGC TTCGCGACGC TCGAGACGCA GTACATCGGC
GACATGGTGG CGGACGACGC CAACGATGTG GCGGTCGACG ATTACTGGCT GGTCCACCTG
CGCGCCGGCG ATGGCTGGCA CCTGGGTGGC GATACCTTGC TCAAGGGCTA CGTGGGGGTG
CGTAACCTCT TCGATCGCGA GCATTTCGCC AATGTGCGGA TCAATGCCAA TAACGACCGC
TATTTCGAAC CGGCATCGGG ACGGACCGTC TACGCTGGTA TGGAAGTCGC GTTCTAG
 
Protein sequence
MTHRYLPASL LLGALSASGA AFSQTISPEV MEVTAPRLDR ELYATPAAVS VVDRDSIAQG 
QQRVRLDESL VTVPGVFLQN RDNFAQGERL AIRGFGARAP FGVRGVTVMA DGIPYTLPDG
QAQLDAIDLD SAQRIEVIRG PSSVLYGNAA GGVLSVTTAD GRDDQKTRLG AEIGSDGYRK
YRFSDGGVNG PWSHHVSVSA LNFDGYRDQS QVEKYHLNAK VRRELGNDRA LTAIVNLLDN
PRSEDPGGLT REQVDEDRNQ AGDFTEEYDT GQNVDQQVLG LQYEDLSAGP GELYVKGFYL
QRDFEQQLPY PGDSLLGYER DYFGGSAEYH QDLLLGELPL RYVVGVDVAR QEDDRWRRNV
EFDGTVGGDT ADETQTATSL GIFAQGDLDL TDKLTLSLGT RYDRVDFDID DDFGSDGDQS
GDRTFREWSG SAGLSYRYLP THQAYVNTGT SFETPTFSEF ANPSGVGGFN PAVEPQKAWN
REIGLRGNFD NGVDYDLALF SVRVRDELVP YNENGRDFYR NAGDSSRDGI ELALGWQMTP
SWRLDSALTL ARYEFDEYDT QDGNYGGNRI PGLPEQTWMN RLTWKGFDER FATLETQYIG
DMVADDANDV AVDDYWLVHL RAGDGWHLGG DTLLKGYVGV RNLFDREHFA NVRINANNDR
YFEPASGRTV YAGMEVAF