Gene B21_03659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03659 
Symbolybl183 
ID8116286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3907327 
End bp3908550 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content51% 
IMG OID644849820 
Producthypothetical protein 
Protein accessionYP_003001393 
Protein GI251787089 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAAAA AAGAAGAGAA TCTGAATACG GCATCAGGAT TGCGTATTGC CATGATTTTG 
CTGGGTATTG CCGTCACACC TGTGCTGTTG TCATCTTCAA GCCTCGGCAA TCAACTTTCC
AGCAGCAGTT TAATTAGCGT CGTATTGTTA GGCGGCGTCA TTCTGACCTT ACTTTCAGCC
ATCACCATTA GCGTGGGAGA AAAAGCCCGC CTGCCAACGT ATGGCATTGT GAAATATTCG
TTTGGCGAAA AAGGGGCCAT CGCCATTAAC ATTTTGATGG CGATAAGTCT GTTCGGCTGG
ATTGCCGTTA CCGCCAATAT GTTTGGTCAT TCGGTACATG ACTTACTGGC TCAACATGGA
CTGGAAGTTC CACTGGCACT GTTAGTGGCG GCTGGCTGTG TCATTTTTGT CGCCTCTACG
GCATTTGGCT TTACCGTTCT GGGAAAAATT GCCCAGGTTG CCGTGCCGGT TATCGCGCTG
GTGCTGTGTT ACATCCTCTA TGTGGCAACC CATACCGAAG TGGCAGTACC AGCGGCGATT
GTGGAGATGA ATACAGGTGT CGCCGTTTCC ACCGTTGTTG GCACCATTAT TGTGCTGGTT
GCCACACTGC CTGATTTCGG TAGTTTTGTG CATAACCGCA AACATGCGCT GATTGCCGCA
GGCGTGACGT TTCTGGTTGC CTACCCTCTG CTCTACTGGG CGGGTGCAAC GCCGAGCGCC
ATTAGTGGTC AGGGATCTTT ACTGGGTGCG ATGGCGGTAT TCGGTGCGGT TCTGCCTGCG
GCGCTGTTGT TGATTTTCGC CTGCGTCACC GGTAACGCGG GCAATATGTT CCAGGGCACG
CTGGTGGTTT CCACACTGCT TACCCGCTTT CCCAAATGGC AGATTACCGT GGCGCTGGGT
ATCCTTTCCG CCATCGTAGG CAGTATGGAT ATTATGGCGT GGTTTATTCC GTTTCTGCTG
TTCCTGGGTA TCGCCACGCC ACCCGTTGCC GGAATTTATA TCGCTGACTT TTTCCTTTAT
CGCCGTAATG GCTATCAAGA GTCAGTGTTA GCCCAGGAGT CACAGATTAA AGTGCTGACA
TTCGCAGCAT GGATCATAGG CGCAGCGGTT GGCTTTATGA CCGTAAAAGG CTTATTCACC
CTGACGACGA TCCCTTCGGT AGACTCGATT CTGGTGGCAT GTATCGCGTA TGCGATTCTC
AGTCGGGCAA GTCAACACCG CTAA
 
Protein sequence
MRKKEENLNT ASGLRIAMIL LGIAVTPVLL SSSSLGNQLS SSSLISVVLL GGVILTLLSA 
ITISVGEKAR LPTYGIVKYS FGEKGAIAIN ILMAISLFGW IAVTANMFGH SVHDLLAQHG
LEVPLALLVA AGCVIFVAST AFGFTVLGKI AQVAVPVIAL VLCYILYVAT HTEVAVPAAI
VEMNTGVAVS TVVGTIIVLV ATLPDFGSFV HNRKHALIAA GVTFLVAYPL LYWAGATPSA
ISGQGSLLGA MAVFGAVLPA ALLLIFACVT GNAGNMFQGT LVVSTLLTRF PKWQITVALG
ILSAIVGSMD IMAWFIPFLL FLGIATPPVA GIYIADFFLY RRNGYQESVL AQESQIKVLT
FAAWIIGAAV GFMTVKGLFT LTTIPSVDSI LVACIAYAIL SRASQHR