Gene Csal_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1044 
Symbol 
ID4027832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1174050 
End bp1176215 
Gene Length2166 bp 
Protein Length721 aa 
Translation table11 
GC content60% 
IMG OID637966221 
ProductTonB-dependent receptor 
Protein accessionYP_573100 
Protein GI92113172 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGACT TACAGCGATC ACGTCTCAAG CAGCTATCGC TGCTCCCGTG GGTATTCGTC 
TCGGCTCTAC CCGCCACGGC ACTGGCCCAG GACTCAACGA GCACGGACTC CCAAGAGAAT
CACGGGCTGG ACCCATTGGT CATCACTGCC ACCCGCAACC AGAGTCGCGA AGATGAGACC
CCGCAGAAGG TCACCATCAT CACCCGCGAG CAGATCGAGC AGCAGTTGGC GATCACCCAG
GATCCCAGCC AGGTGCTCAG TAACTTGATC CCTTCCTATT CGCCTAGCCG CCAGAAGCTC
AACAATACCG GCGAGACCTT CCGCGGTCGT TCGGCCCTTT TCATGATCGA CGGCGTTCCT
CAGTCCAATC CGCTGCGCGA CGTGGGAAGA GATAGTTACA CCATCGATCT CTCCATGGTC
GAGCGTATCG AAGTCATATA CGGCGCCAGC GCCGAGCATG GCCTGGGTGC GACCGGCGGT
ATCATCAACT ATGTGACCAG GCGACCTGAA GGCGCCGGGG TAAGCCAGCA TGTCGGCGTC
AGCTTGACCA GTGACGACGA TTTCGAGTCG GAAGGTTTCG GCCACAAGCT GGACTACCGT
CTCAGCGGCC AGACCCGCGA TTGGGACTAC ATGGTCGCCG CTAGCCGCCA AAAACGCGGT
GTCTTCTACG ATGGCAACGA CGAGATCGTC GGCATGGCCT ACCATGGCGA GATCCAGAAC
TCCGAAAGCT ACGACCTGAT GGCCAAGGCG GCCTATTGGA TCGACGACGA TCAGAACGTC
GAATTCTCCC TCAATCATTA CGACCTCGAC GTCGGCGACG ATTACGTGCC GGTGTCCGGT
GATCGCGATA AAGGCGTAGC GACTACGGCC GAAAAAGGCA ACCCCGCTAA CGACCCAGGC
TATAATAGGG TCACCACGGC TAGGCTGGCC TACTCGAACA AAGACTGGCT AGGCAACGAG
CTGGATGCCC AACTCTACAC CCAGCGTTTC CGCGCCCAGT TTGGTGCCAC TGGCCTCGGC
TCCTTCCCTT ACCAGGACGA CGCGGGCAAT ACCCGGTACG ACCACACACG CAACGAGTCC
GACAAGGTCG GGGCCAAGTT CACTCTAAGC CGCGACGGAC TGCTCGACGA CCGCCTCAAG
CTGACCACGG GTCTGGACCT GCTGCAGGAC GAGACCCAAC AGGTGCTGGT CAAGACCGAC
CGCAGCTATG TACCCGAGAG CCAGTTCCGC AACTATGCGG TCTTCCTCCA AGGGGACTAC
GACCTGACCC AAGCTCTGAG CCTGCATGCC GGGGCTCGCC AGGAGCATGC TACGCTAAAC
GTCGACGATT ACTCGACGGT AGACCGTAGT ACCAGTGTCG AAAACGACCT GGTCTCGGTG
GGTGGCGGCA ACCCCAGTTT CGACGAAACC TTGTTCAATG CCGGTATCGT CTATCAAGCC
ACCGACTGGG CCCAGCTCTA CGCCAACTAC TCCGAAGGTT TCGGCATGCC CGATGTCGGC
CGAGTGCTGC GCAGTGTCAG CGAGCCCGGC CAGGACGTCG ACACCCGAGT CGACCTCTCA
CCCATCGTCA CCGACAATCG CGAGATCGGG GCGCGCTTCG ACTGGGACCG CTACGGGCTC
GAGCTGAGTT ATTATGAGTC GAACTCCGAC CTTGGCCAAC GCATCGAACC GGATGCTCAG
AATAACTACC GGGTCAAGCG AGAGAAGACC GAGATCCAGG GCTATGAGAT CACCGGCGAG
GCGCAGGTGA GTGATGCTCA TCAATTGCGG CTTTCCTATA CGCACACCGA GGGCAAGTCG
GACACTGACG GTGATGGCAG CGTGGATACC AAACTGACGG GCCGAGACCT TGCCCCGGAC
ACCCTCAAGC TCGCCTGGAG CGCGGCCTGG AACGAAAAGC TATCCTCTTA CCTGCAGTAC
AGCTACTACC GTGATCGCAG CTTCGACGAT CCCGAGCTCG AGTTCGACGG CTACGGCCTG
ATCGATGCCT CCCTGGCCTA TCGCCTTCCC ATAGGCCGTG CCAGCCTGGG CGTGGAAAAC
CTCACCGACA AGGACTACTT CACCTACTAC TCACAATCGT TCCCCCAAGG GGAGGCTCTT
GAGGACGACC TATATTTCAA GGGCCGGGGA CGCACTTTCA CCCTGAGCTA TCAGCTCGAC
TTCTGA
 
Protein sequence
MQDLQRSRLK QLSLLPWVFV SALPATALAQ DSTSTDSQEN HGLDPLVITA TRNQSREDET 
PQKVTIITRE QIEQQLAITQ DPSQVLSNLI PSYSPSRQKL NNTGETFRGR SALFMIDGVP
QSNPLRDVGR DSYTIDLSMV ERIEVIYGAS AEHGLGATGG IINYVTRRPE GAGVSQHVGV
SLTSDDDFES EGFGHKLDYR LSGQTRDWDY MVAASRQKRG VFYDGNDEIV GMAYHGEIQN
SESYDLMAKA AYWIDDDQNV EFSLNHYDLD VGDDYVPVSG DRDKGVATTA EKGNPANDPG
YNRVTTARLA YSNKDWLGNE LDAQLYTQRF RAQFGATGLG SFPYQDDAGN TRYDHTRNES
DKVGAKFTLS RDGLLDDRLK LTTGLDLLQD ETQQVLVKTD RSYVPESQFR NYAVFLQGDY
DLTQALSLHA GARQEHATLN VDDYSTVDRS TSVENDLVSV GGGNPSFDET LFNAGIVYQA
TDWAQLYANY SEGFGMPDVG RVLRSVSEPG QDVDTRVDLS PIVTDNREIG ARFDWDRYGL
ELSYYESNSD LGQRIEPDAQ NNYRVKREKT EIQGYEITGE AQVSDAHQLR LSYTHTEGKS
DTDGDGSVDT KLTGRDLAPD TLKLAWSAAW NEKLSSYLQY SYYRDRSFDD PELEFDGYGL
IDASLAYRLP IGRASLGVEN LTDKDYFTYY SQSFPQGEAL EDDLYFKGRG RTFTLSYQLD
F