Gene Csal_0111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0111 
Symbol 
ID4027037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp133431 
End bp134693 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content63% 
IMG OID637965262 
Productnucleoside transporter 
Protein accessionYP_572174 
Protein GI92112246 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACTA TCATGAGCCT GGTAGGGATG GCGACGCTGA TCCTTATCGC GGTGCTGTTT 
TCCTCCGATC GCAAGTCGAT CCGGTTGCGT ACGGTTGGCG GTGCCTTTGC CATCCAGGCC
GCCATCGGCG CCTTCGTGCT TTACATTCCC TTCGGACAAG CCGTCCTGGC AACGCTCTCG
GAGGGAGTCA GTCAGGTCAT CGTCTATGCC GACGATGGCA TCAACTTCTT GTTCGGCGAC
CTGGCCAACC CCGAGAGCGT CGGATTCGTC TTCGCCTTCA AGGTATTGCC GATCATCATC
TTCTTCTCGT CGCTGATCGC CGTGCTCTAC CACCTGAAGA TCATGCAGTG GATCATCCGC
CTGCTGGGGG GCGCACTTCA GCGCGTGCTG GGCACCTCTC GCACCGAATC GATGTCGGCC
ACGGCGAATA TCTTCGTCGG TCAGACCGAA GCCCCCCTGG TCGTGCGCCC CTTCATTGCC
TCCATGACAC GCTCCGAGCT GTTCGCCGTC ATGTGCGGGG GCCTGGCGTC GGTGGCGGGC
TCGGTACTGG CGGGATACGC GAGCCTGGGC ATTCCCATGG AGTACCTGAT CGCGGCTTCC
TTCATGGCGG CGCCGGGCGG TTTGCTGTTC GCCAAGCTGC TGATGCCGGA GACCGAAAAG
CCCGATGACA GCATCTCGCG AGCCGAAGAG AAGATCGAGG AAGACGAAAA ACCCGCCAAC
GTGCTCGACG CCGCGGCCAC CGGGGCCACA TCCGGCATGA TGCTGGCAGC CAACGTGGGG
GCCATGCTGC TGGCGTTCAT CGGTCTGATC GCATTGCTCA ACGGCATTCT CGGCGGCGTG
GGCGGCTGGT TTGGCATGGA GTCGCTGAGT CTGGAAATGA TTCTCGGCTG GCTCTTCGCG
CCGCTGGCCT TCCTGCTGGG CATTCCCTGG AGCGAAGCCA CGCTCGCCGG CTCGTTCATC
GGTCAGAAAA TCGTCGTCAA CGAGTTCGTC GCCTTCATCA ACCTGGCGCC GTACATCTCC
GGCGATACCG TCGTGGCCGC CACCGGCGAG GCCATGTCCA AGCACACCGC CGCCGTGCTG
TCCTTCGCGC TTTGCGGCTT CGCCAACCTG TCCTCGATCG CCATTCTCCT CGGCGGGCTC
GGTCTCATGG CGCCCAACCG CCGCCAGGAA ATCGCCCGCT ACGGCCTCAA GGCCGTGCTC
GCCGGTACGC TTTCCAACCT GATGTCGGCC ACCATCGCCG GGCTCTTCAT CAGCTTGGCA
TGA
 
Protein sequence
MTTIMSLVGM ATLILIAVLF SSDRKSIRLR TVGGAFAIQA AIGAFVLYIP FGQAVLATLS 
EGVSQVIVYA DDGINFLFGD LANPESVGFV FAFKVLPIII FFSSLIAVLY HLKIMQWIIR
LLGGALQRVL GTSRTESMSA TANIFVGQTE APLVVRPFIA SMTRSELFAV MCGGLASVAG
SVLAGYASLG IPMEYLIAAS FMAAPGGLLF AKLLMPETEK PDDSISRAEE KIEEDEKPAN
VLDAAATGAT SGMMLAANVG AMLLAFIGLI ALLNGILGGV GGWFGMESLS LEMILGWLFA
PLAFLLGIPW SEATLAGSFI GQKIVVNEFV AFINLAPYIS GDTVVAATGE AMSKHTAAVL
SFALCGFANL SSIAILLGGL GLMAPNRRQE IARYGLKAVL AGTLSNLMSA TIAGLFISLA