Gene Csal_0089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0089 
Symbol 
ID4026011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp112276 
End bp113766 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content64% 
IMG OID637965240 
ProductNa+/solute symporter 
Protein accessionYP_572152 
Protein GI92112224 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000762337 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGTC ACGTATTCCT CGGTGCCTTC GTCACCTACG TGGTGGCCAT GATCGCGTTC 
GGGTGGTGGG TCTCCCGCCA TAGCCGAAGC AATGGCGATG ACTTCCTGCT CGGTGGGCGC
AGCATCCCGA TCTTCCTGAC GATCGGGACC ACCGTCGCCA CCATGGTGGG TACCGGCTCG
AGCATGGGCG CGGTCGGCTT CGGCTATGCC AACGGCTGGG CCGGGGCCCT CTACGGCATC
GGCGGCTCCA TCGGCGTGCT GCTGCTGGCG GCGTGGTTCG CCCCCGTACG CAAGCTGCGC
TTCATGACCA TGAGCGAGGA GCTTTCCTAT TATGTCGGCG CCAACCGCTG GGTGCGCAAC
ATCGTCGCGG TGCTGATCTA CATCGCCTGT ATCGGCTGGC TCGGCGCGCA CATTCTCGGT
GGCGGGCTCT ATCTATCCTG GATGGCCGAC ATCGACCTCA CCACCGCCCG CGTTCTGGTG
GCGCTGGGCT TCGGCATCTA CTGCGTGATC GGCGGTTACA TGGCCGTGGT GTGGACCGAT
ACCGTCCAGG CGGTGATATT GTTCGTCGGC TTCATCGTGA TGGCCATCAT CGCATTGTTC
GAAGTGGGTG GATTCTCGGG ACTGGGCGCC AACATGGACG TCGCCACCAC CAGTTTCCTG
GGCATCGACA AGATCGGCGT GGTGCCGGCA CTGTCGCTGG CGGCGGTCAT CGCCGTCGGC
GTGCTGGCAA CCCCCTCCTA CCGTCAGCGC ATCTACTCCG GCAACTCGGT CCATTCCGTA
CGCAAGTCAT TCGTGATCAC CGGCATTCTT TACATGATCT TCTCGATCAT CCCGGCGATC
ATCGGCATGG CCACCCACGC GCTCAACCCC GGTCTGGACA ACAGCAACTT CGCCTTTCCG
TTTCTGGCGA CCGAGATCCT GCCGCTGTGG CTGGGTTTGT TGCTGCTCGT CGCGGGCCTC
TCGGCGACCA TGTCGAGCGC CAGCTCGGAC GCCATCGCCG GCGTCTCGAT CCTGTTGCGC
GACGTCTACG TGCTGTTCAC CGGGCATACG CCTTCGGCAC ACAAGGTCGT CTTGCTCTCG
CGTCTCGCGC TGATCGCCAC CATCGGCCTG GCGCTGCTGT TCGCATTGAC CTCCGACAAC
GTCATCGACT ACATCACCAG CATGATCGCC ACGGTCATGG CCGGCATGTT CGTGTGTGGC
GTGCTGGGTC GCTTCTGGCC GCGCTACAAC TGGCAGGGTG CCGTCGCCAC GCTCATCGCG
GCCTCCGCCA CCTCGCTCGC CGTCATCGGC GTCGATAGCT GGAGTGCCTT CTGGGGCAAC
CCGAGCATTC CCGCCGTGCT GGTCGCACTC CTCGTGGGTG TCGTGGTCAG CCTGATCACC
CCCGCTTCGA AGGTGACACC GGAGCAGGCG CTCAAGATCA TCGACGACGA GCGTGAGCGC
ATGGAGATGA CCGAGGACCC CGCCGAGGCC ATTCATCACA AGACGGCATA A
 
Protein sequence
MNSHVFLGAF VTYVVAMIAF GWWVSRHSRS NGDDFLLGGR SIPIFLTIGT TVATMVGTGS 
SMGAVGFGYA NGWAGALYGI GGSIGVLLLA AWFAPVRKLR FMTMSEELSY YVGANRWVRN
IVAVLIYIAC IGWLGAHILG GGLYLSWMAD IDLTTARVLV ALGFGIYCVI GGYMAVVWTD
TVQAVILFVG FIVMAIIALF EVGGFSGLGA NMDVATTSFL GIDKIGVVPA LSLAAVIAVG
VLATPSYRQR IYSGNSVHSV RKSFVITGIL YMIFSIIPAI IGMATHALNP GLDNSNFAFP
FLATEILPLW LGLLLLVAGL SATMSSASSD AIAGVSILLR DVYVLFTGHT PSAHKVVLLS
RLALIATIGL ALLFALTSDN VIDYITSMIA TVMAGMFVCG VLGRFWPRYN WQGAVATLIA
ASATSLAVIG VDSWSAFWGN PSIPAVLVAL LVGVVVSLIT PASKVTPEQA LKIIDDERER
MEMTEDPAEA IHHKTA