Gene CPS_4056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_4056 
Symbol 
ID3521694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp4267261 
End bp4268808 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content47% 
IMG OID637286500 
ProductNCS1 nucleoside transporter 
Protein accessionYP_270712 
Protein GI71281223 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATGA GCAACAGTAA TTTAAATTCT TCGTCAGTCG TTCAGGTTGG CGAATTTTAT 
GAGCTAGAAG TGGGCAAAGA TGTCCAGGAT AGTGATCACT ATAATGAAGA TATGGCCCCG
ACGAGTGTGA AAGACAGGAC TTGGAATACC TGGAATGTCG CGGCGTTGTG GGTGGGCATG
GCAATTTGTG TGCCGACCTA TACCCTAGGG GGGGTCTTGA CGGCTTATTT CGGTTTGAGT
GTAACTGAAG CGCTGATCAC CATATTACTG GCAAATATCG TAGTATTGAT TCCCTTGACC
CTTAATGCGT ATCCTGGTAC AAAATATGGC ATTCCTTTCC CGGTATTATT ACGTTCTTCC
TTTGGTATCA AAGGCTCCAA TATTCCCTGT ATGATCCGAG CGCTAGTCGC CTGTGGCTGG
TTTGGGATCC AAACCATGTT TGGCGGCGTC GCTATTCATA TTTTGATGTC AAATCTGTTT
GAGTCCTGGG CGACCTTGGG AGGAACGGGT GAGGTTTTTG GCTTCTTTAC TTTTCTCGCT
ATTAACCTGT TTATCGTTAT CAAGGGCTCC GAATCAATTA AGATCCTTGA AACCGTTGCC
GCACCGCTGT TGCTTGCGGT AGGTATTGGT CTGATGATGT GGGCCTATCC ACAGATCTCT
GTGACTGAAA TTCTAGCCAC ACCCGCTAAC CGTCCTGAAG GTGCATCATT CTGGGGTTAC
TTCTTTGGCG GGTTAACTGC GATGGTTGGT TTTTGGGCAA CCTTGTCGCT GAACATTCCG
GATTTTAGTC GTTATGTTAA GTCACAGAAA TCACAAATCG CAGGCCAAAT CATAGGCCTG
CCAGCTACCA TGTTCTTCTT TTCCGCTTTA GGGGTAGTCT TGACTGCCGC TTCAACGACC
TTGGTTGGTG AAACTATCTC TGATCCAATT AACCTTATTG GCAAAATCGA CAGCCCAGTA
TGGGTGGTGA TTGCAATGGT GATGATTATT ATCGCGACGC TGTCGACTAA TACTGCCGCC
AATGTTGTGT CACCTACCAA TGATTTTCAG AACCTAGCAC CCAAGAAGAT TAGTCAAACA
CGCGGTGTAT TACTGACTGG TCTGCTGGGT GTGTTACTGA TGAGCTGGGA GCTGCTTAAA
AAACTGGGTT GGATTGAGTC TGATGTTAGT GTTGAAGCCA TGTACACCGG CTGGTTATTG
GGCTACTCCA GTTTGCTGGG GCCGATAGCC GGGATCATGG TGGTCGATTA CTTTATTATC
AAAAAACAGC GTCTGGAATT GGCAGAGCTC TATAAATCAG AGGGTATTTA CGGTGGCTTC
AATAAAGCTG GACTGCTTGC TTTCGGAATC CCTGTCACCT TGACGCTCAT TGCGATAACT
ACAGGCATGT TCTCCTGGTT TTATCAATTC GGTTGGTTCA CTGGGTCTAT TATGGGTGGT
GTGGTGTATT TCATCGCCGC CAGCAAACAG CAAGCAGAAA GTAGTGTTGG GATCCAGCAA
ACAGCGTCGG ATGCCAGCGA ACTGAAAAGC CTGAGAAACA ACGCATAA
 
Protein sequence
MKMSNSNLNS SSVVQVGEFY ELEVGKDVQD SDHYNEDMAP TSVKDRTWNT WNVAALWVGM 
AICVPTYTLG GVLTAYFGLS VTEALITILL ANIVVLIPLT LNAYPGTKYG IPFPVLLRSS
FGIKGSNIPC MIRALVACGW FGIQTMFGGV AIHILMSNLF ESWATLGGTG EVFGFFTFLA
INLFIVIKGS ESIKILETVA APLLLAVGIG LMMWAYPQIS VTEILATPAN RPEGASFWGY
FFGGLTAMVG FWATLSLNIP DFSRYVKSQK SQIAGQIIGL PATMFFFSAL GVVLTAASTT
LVGETISDPI NLIGKIDSPV WVVIAMVMII IATLSTNTAA NVVSPTNDFQ NLAPKKISQT
RGVLLTGLLG VLLMSWELLK KLGWIESDVS VEAMYTGWLL GYSSLLGPIA GIMVVDYFII
KKQRLELAEL YKSEGIYGGF NKAGLLAFGI PVTLTLIAIT TGMFSWFYQF GWFTGSIMGG
VVYFIAASKQ QAESSVGIQQ TASDASELKS LRNNA