Gene Csal_1503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1503 
Symbol 
ID4028406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1704939 
End bp1706183 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content65% 
IMG OID637966686 
Productnucleoside transporter 
Protein accessionYP_573555 
Protein GI92113627 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.570754 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACCC TCATGGGGCT TGTCGGCGTG ATCGCCGTCC TACTGATCGC CTTCGCCCTA 
TCCGAGAACC GACGCGCCAT TCGCTTGCGC ACCGTCGGCG GCGCCCTCGC CATTCAAGCC
GGCCTGGGCC TGTTCGTGTT GTACATCCCC TTCGGTCACA GCGTGCTGCT GGGCCTGACG
CACGGCGTGC AATCCGTCAT CGACAGTGCC CAGGCAGGCA TGGACTTCGT CTTCGGCGGG
CTGGTCAGCG ATACCATGTT CGAGGTCTTC GGCGATGGCG GTTTCGTCTT CGCCCTGCGC
GTCTTGCCGA TCATCGTGTT CTTTTCCTCG CTGGTCGCCG TGCTGTATTA CCTGGGCATC
ATGCGCTGGG TCATCAAGCT GATCGGTGGC GCGTTGCGCA AGCTGCTGGG CACCTCGCGC
ACCGAGTCGC TCTCCGCCGC GGCCAATATC TTCGTCGGCC AGACCGAGGC TCCGATGGCC
GTGCGCCCCT TCCTGTCGCG CATGACGCGC TCCGAGCTGT TCGCGGTGAT GGTCGGTGGT
CTGGCCTCGG TCGCCGGCAG CGTGCTGGGC GGTTACGCCG CCATGGGCAT CTCCCTCGAG
TACCTGCTGG CCGCCTCGTT CATGGCGGCG CCCGGAGGGT TGTTGATGGC CAAGATGCTG
GTGCCGGAAA CCGAGCGTCA GGCAGCCGAC CTGGAAGAGG TCATCGACGG CGAAGACGAA
CAGGCGCCCG CCAACGTCAT CGAAGCCGCG GCCAACGGCG CGGCTTCCGG CGTGCAACTG
GCCATCAACG TCGGCGGCAT GCTGCTCGCG TTCATCGCGC TGATCGCGCT GGGCAACCAG
ATCATCGGCG GCATCGGCGG CTGGTTCGGC TATCCGGAAC TGACCCTCGA ACTGCTGCTC
GGCTACCTGT TCGCACCGCT GGCGTTCTTG ATCGGCGTGC CCTGGGCGGA AGCCATTCAA
GCCGGCAGTT TCCTCGGCCA GAAGCTGGTG CTCAACGAGT TCGTCGCCTT CGCCAATCTC
GCGGCCGATC CCCAGGCGCT GAGCGCCCAT TCCAAGGCGA TCGTGATCTT CGCCCTGTGC
GGTTTCGCCA ACCTCTCGTC CATCGCCATT CTCATGGGGG GACTCGGCAT GATGGCGCCC
AACCGGCGCA GCGACATCGC CAGGATGGGG CTCAAGGCGG TTGCCGCCGG CACGCTCTCC
AATCTGATGA GTGCCGCCCT GGCAGGGCTG TTCCTCTCGC TCTGA
 
Protein sequence
MQTLMGLVGV IAVLLIAFAL SENRRAIRLR TVGGALAIQA GLGLFVLYIP FGHSVLLGLT 
HGVQSVIDSA QAGMDFVFGG LVSDTMFEVF GDGGFVFALR VLPIIVFFSS LVAVLYYLGI
MRWVIKLIGG ALRKLLGTSR TESLSAAANI FVGQTEAPMA VRPFLSRMTR SELFAVMVGG
LASVAGSVLG GYAAMGISLE YLLAASFMAA PGGLLMAKML VPETERQAAD LEEVIDGEDE
QAPANVIEAA ANGAASGVQL AINVGGMLLA FIALIALGNQ IIGGIGGWFG YPELTLELLL
GYLFAPLAFL IGVPWAEAIQ AGSFLGQKLV LNEFVAFANL AADPQALSAH SKAIVIFALC
GFANLSSIAI LMGGLGMMAP NRRSDIARMG LKAVAAGTLS NLMSAALAGL FLSL