Gene B21_00294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00294 
SymbolcodB 
ID8113470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp324355 
End bp325614 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content54% 
IMG OID644846581 
Producthypothetical protein 
Protein accessionYP_002998154 
Protein GI251783850 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGCAAG ATAACAACTT TAGCCAGGGG CCAGTCCCGC AGTCGGCGCG GAAAGGGGTA 
TTGGCATTGA CGTTCGTCAT GCTGGGATTA ACCTTCTTTT CCGCCAGTAT GTGGACCGGC
GGCACTCTCG GAACCGGTCT TAGCTATCAT GATTTCTTCC TCGCAGTTCT CATCGGTAAT
CTTCTCCTCG GTATTTACAC TTCATTTCTC GGTTACATTG GCGCAAAAAC CGGCCTGACC
ACTCATCTTC TTGCTCGCTT CTCGTTTGGT GTTAAAGGCT CATGGCTGCC TTCACTGCTA
CTGGGCGGAA CTCAGGTTGG CTGGTTTGGC GTCGGTGTGG CGATGTTTGC CATTCCGGTG
GGTAAGGCAA CCGGGCTGGA TATTAATTTG CTGATTGCCG TTTCCGGTTT ACTGATGACC
GTCACCGTCT TTTTTGGCAT TTCGGCGCTG ACGGTTCTTT CGGTGATTGC GGTTCCGGCT
ATCGCCTGCC TGGGCGGTTA TTCCGTGTGG CTGGCTGTTA ACGGCATGGG CGGCCTGGAC
GCATTAAAAG CGGTCGTTCC CGCACAACCG TTAGATTTCA ATGTCGCGCT GGCGCTGGTT
GTGGGGTCAT TTATCAGTGC GGGTACGCTC ACCGCTGACT TTGTCCGGTT TGGTCGCAAT
GCCAAACTGG CGGTGCTGGT GGCGATGGTG GCCTTTTTCC TCGGCAACTC GTTGATGTTT
ATTTTCGGTG CAGCGGGCGC TGCGGCACTG GGCATGGCGG ATATCTCTGA TGTGATGATT
GCTCAGGGCC TGCTGCTGCC TGCGATTGTG GTGCTGGGGC TGAATATCTG GACCACCAAC
GATAACGCAC TCTATGCGTC GGGTTTAGGT TTCGCCAACA TTACCGGGAT GTCGAGCAAA
ACCCTTTCGG TAATCAACGG TATTATCGGT ACGGTCTGCG CATTATGGCT GTATAACAAT
TTTGTCGGCT GGTTGACCTT CCTTTCGGCA GCTATTCCTC CAGTGGGTGG CGTGATCATC
GCCGACTATC TGATGAACCG TCGCCGCTAT GAGCACTTTG CGACCACGCG TATGATGAGT
GTCAATTGGG TGGCGATTCT GGCGGTCGCC TTGGGGATTG CTGCAGGCCA CTGGTTACCG
GGAATTGTTC CGGTCAACGC GGTATTAGGT GGCGCGCTGA GCTATCTGAT CCTTAACCCG
ATTTTGAATC GTAAAACGAC AGCAGCAATG ACGCATGTGG AGGCTAACAG TGTCGAATAA
 
Protein sequence
MSQDNNFSQG PVPQSARKGV LALTFVMLGL TFFSASMWTG GTLGTGLSYH DFFLAVLIGN 
LLLGIYTSFL GYIGAKTGLT THLLARFSFG VKGSWLPSLL LGGTQVGWFG VGVAMFAIPV
GKATGLDINL LIAVSGLLMT VTVFFGISAL TVLSVIAVPA IACLGGYSVW LAVNGMGGLD
ALKAVVPAQP LDFNVALALV VGSFISAGTL TADFVRFGRN AKLAVLVAMV AFFLGNSLMF
IFGAAGAAAL GMADISDVMI AQGLLLPAIV VLGLNIWTTN DNALYASGLG FANITGMSSK
TLSVINGIIG TVCALWLYNN FVGWLTFLSA AIPPVGGVII ADYLMNRRRY EHFATTRMMS
VNWVAILAVA LGIAAGHWLP GIVPVNAVLG GALSYLILNP ILNRKTTAAM THVEANSVE