Gene B21_02668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02668 
SymbolygeZ 
ID8113996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2840098 
End bp2841483 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content53% 
IMG OID644848864 
Producthypothetical protein 
Protein accessionYP_003000437 
Protein GI251786133 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGTAT TGATCAAAAA CGGCACTGTC GTTAACGCAG ATGGACAAGC CAAACAGGAT 
TTGCTGATTG AAAGCGGGAT TGTTCGCCAG TTGGGCAACA ATATTTCGCC GCAGCTCCCG
TATGAAGAAA TTGATGCCAC TGGCTGTTAC GTTTTCCCTG GCGGCGTGGA TGTCCATACG
CATTTCAATA TTGATGTCGG CATCGCGCGC AGTTGTGATG ATTTTTTTAC CGGTACCCGC
GCAGCTGCGT GTGGCGGTAC AACAACCATT ATTGACCATA TGGGATTTGG CCCAAACGGC
TGTCGGTTAC GCCATCAACT GGAGGTTTAT CGTGGTTATG CCGCCCATAA AGCGGTCATC
GATTACAGCT TTCACGGTGT GATCCAGCAC ATTAATCACG CAATCCTCGA CGAAATCCCG
ATGATGGTCG AGGAAGGACT GAGCAGTTTT AAACTCTATT TAACCTATCA ATACAAACTC
AACGATGACG AGGTTTTGCA GGCATTACGC CGTCTGCATG AATCCGGCGC GCTGACCACC
GTGCACCCGG AAAATGATGC GGCTATCGCC AGCAAGCGGG CGGAGTTTAT CGCCGCAGGG
TTAACCGCGC CGCGCTATCA CGCCTTGAGT CGCCCTCTGG AATGCGAAGC GGAAGCCATC
GCCCGCATGA TTAACCTGGC ACAAATTGCC GGTAACGCCC CGCTCTATAT CGTGCACCTG
TCTAACGGCT TAGGTCTGGA TTATCTGCGT CTTGCCCGTG CGAATCACCA GCCAGTCTGG
GTTGAAACCT GCCCACAATA TCTCCTGTTG GACGAACGCA GTTACGATAC AGAAGATGGC
ATGAAGTTCA TTCTTAGCCC ACCGCTGCGT AACGTACGCG AGCAGGACAA ACTGTGGTGT
GGCATCAGCG ATGGTGCGAT TGACGTGGTG GCAACCGATC ACTGCACCTT CTCGATGGCT
CAACGCCTGC AAATTTCTAA AGGCGATTTC AGTCGCTGCC CAAATGGCTT ACCCGGTGTG
GAAAACCGCA TGCAGTTACT GTTTTCCAGT GGCGTGATGA CGGGACGTAT AACACCGGAA
CGCTTTGTTG AATTAACCAG CGCAATGCCC GCCAGGCTGT TTGGCCTGTG GCCGCAAAAA
GGATTATTAG CGCCCGGTTC CGACGGCGAC GTGGTGATTA TCGACCCACG TCAGAGCCAA
CAAATTCAGC ATCGCCATCT CCACGACAAC GCCGACTACT CGCCATGGGA GGGTTTTACC
TGTCAGGGCG CGATTGTCAG AACCTTATCC CGTGGTGAAA CGATTTTCTG TGACGGCACC
TTTACAGGCA AAGCCGGGCG AGGTCGTTTC CTGCGACGCA AACCGTTTGT CCCTCCCGTG
CTCTAA
 
Protein sequence
MRVLIKNGTV VNADGQAKQD LLIESGIVRQ LGNNISPQLP YEEIDATGCY VFPGGVDVHT 
HFNIDVGIAR SCDDFFTGTR AAACGGTTTI IDHMGFGPNG CRLRHQLEVY RGYAAHKAVI
DYSFHGVIQH INHAILDEIP MMVEEGLSSF KLYLTYQYKL NDDEVLQALR RLHESGALTT
VHPENDAAIA SKRAEFIAAG LTAPRYHALS RPLECEAEAI ARMINLAQIA GNAPLYIVHL
SNGLGLDYLR LARANHQPVW VETCPQYLLL DERSYDTEDG MKFILSPPLR NVREQDKLWC
GISDGAIDVV ATDHCTFSMA QRLQISKGDF SRCPNGLPGV ENRMQLLFSS GVMTGRITPE
RFVELTSAMP ARLFGLWPQK GLLAPGSDGD VVIIDPRQSQ QIQHRHLHDN ADYSPWEGFT
CQGAIVRTLS RGETIFCDGT FTGKAGRGRF LRRKPFVPPV L