Gene EcHS_A0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0401 
SymbolcodB 
ID5595154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp420425 
End bp421684 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content54% 
IMG OID640919586 
Productcytosine permease 
Protein accessionYP_001457171 
Protein GI157159853 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones70 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGCAAG ATAACAACTT TAGCCAGGGG CCAGTCCCGC AGTCGGCGCG GAAAGGGGTA 
TTGGCATTGA CGTTCGTCAT GCTGGGATTA ACCTTCTTTT CCGCCAGTAT GTGGACCGGC
GGCACTCTCG GAACCGGTCT TAGCTATCAT GATTTCTTCC TCGCAGTTCT CATCGGTAAT
CTTCTCCTCG GTATTTACAC TTCATTTCTC GGTTACATTG GCGCAAAAAC CGGCCTGACC
ACTCATCTTC TTGCTCGCTT CTCGTTTGGT GTTAAAGGCT CATGGCTGCC TTCACTGCTA
CTGGGCGGAA CTCAGGTTGG CTGGTTTGGC GTTGGCGTGG CGATGTTTGC TATTCCGGTG
GGCAAGGCAA CCGGGCTGGA TATTAATTTG CTGATTGCCG TTTCCGGTTT ACTGATGACC
GTCACCGTCT TCTTTGGCAT TTCGGCGCTG ACGGTTCTTT CGGTGATTGC GGTTCCGGCT
ATTGCCTGCC TTGGCGGTTA TTCCGTGTGG CTGGCCGTTA ACGGCATGGG CGGCCTGGAC
GCCTTAAAAG CGGTCGTTCC CGCACAACCG TTAGATTTCA ATGTCGCGCT GGCGCTGGTT
GTGGGGTCAT TTATCAGTGC GGGTACGCTC ACCGCTGACT TTGTCCGGTT TGGTCGCAAT
GCCAAACTGG CGGTGCTGGT GGCGATGGTG GCCTTTTTCC TCGGCAACTC GTTGATGTTT
ATTTTCGGTG CAGCGGGCGC TGCGGCACTG GGGATGGCAG ATATCTCTGA TGTGATGATT
GCTCAGGGAC TGCTGCTGCC CGCGATTGTG GTGCTGGGGC TGAATATCTG GACCACCAAC
GATAACGCAC TCTATGCGTC GGGTTTAGGT TTCGCCAACA TTACCGGTAT GTCGAGCAAA
ACCCTTTCGG TAATCAACGG TATTATCGGT ACGGTCTGCG CATTATGGCT GTATAACAAT
TTTGTCGGCT GGTTGACCTT CCTTTCGGCA GCTATTCCTC CGGTAGGGGG CGTGATCATC
GCCGACTATC TGATGAACCG TCGCCGCTAT GAGCACTTTG CGACCACGCG TATGATGAGT
GTCAATTGGG TGGCGATTCT GGCGGTCGCG CTGGGGATTG CCGCAGGCCA CTGGTTACCG
GGAATTGTTC CGGTCAACGC GGTATTAGGT GGCGCGCTGA GCTATCTGAT CCTTAACCCG
ATTTTGAATC GTAAAACGAC AGCAGCAATG ACGCATGTGG AGGTTAACAG TGTCGAATAA
 
Protein sequence
MSQDNNFSQG PVPQSARKGV LALTFVMLGL TFFSASMWTG GTLGTGLSYH DFFLAVLIGN 
LLLGIYTSFL GYIGAKTGLT THLLARFSFG VKGSWLPSLL LGGTQVGWFG VGVAMFAIPV
GKATGLDINL LIAVSGLLMT VTVFFGISAL TVLSVIAVPA IACLGGYSVW LAVNGMGGLD
ALKAVVPAQP LDFNVALALV VGSFISAGTL TADFVRFGRN AKLAVLVAMV AFFLGNSLMF
IFGAAGAAAL GMADISDVMI AQGLLLPAIV VLGLNIWTTN DNALYASGLG FANITGMSSK
TLSVINGIIG TVCALWLYNN FVGWLTFLSA AIPPVGGVII ADYLMNRRRY EHFATTRMMS
VNWVAILAVA LGIAAGHWLP GIVPVNAVLG GALSYLILNP ILNRKTTAAM THVEVNSVE