Gene B21_03661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03661 
Symbolybl185 
ID8116288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3908982 
End bp3910082 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content55% 
IMG OID644849822 
Producthypothetical protein 
Protein accessionYP_003001395 
Protein GI251787091 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0549] Carbamate kinase 
TIGRFAM ID[TIGR00746] carbamate kinase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCATTTTACC GGTCATCAAT ACGGGTATCG CCCACAAACA AGCGGGGGTC GGGCAAATTG 
GTGCCGGGAT CACCACTGCG CCGATGGCCT GTTTTGTCGC CGCTGTACGT GCACTGGCGG
AGATCGTCGC AAAGGAGAAC CATCATGGTT AAGCCACTGG CTGTCGTCGC GGTTGGCGGC
AATGCGCTCA TTCAGGACGA GCAACGCAAT AGTATTCCCG ATCAATATGT TGCAGTGATG
GAAAGCGTGC AACATATCGT TGATATGGTT GAAGCCGGAT GGGACCTGGT ACTAACCCAC
GGTAATGGCC CGCAGGTGGG CTTTATTCTG CGCCGCTCTG AACTCGCCAG TAACGAAGTT
TCTCCGGTTC CACTTGATTA CGCCGTGGGT GATACACAAG GTGCAATTGG CTACATGTTC
CAGAAAGCGC TGCATAACGA ATTGGCTCGC CGTGGCATAA ACAAACCGGT AATTGCCCTG
GTGACACAAA CGCGAGTCAG CCCACATGAC GATGCTTTCG CCAGCCCCAG TAAACCAATT
GGCGCGTTTC TCGATGAAGC AACAGCCCAA CAACGCCAAC AACAACTCGG CTGGACGCTG
ATGGAGGACG CCGGGCGTGG TTGGCGGCGT ACAGTTCCCT CTCCTGCACC ACTGGAAATT
ATTGAGCACG ACACCATCGC TCACCTGGTG CGCCAGGGAT ATCTGGTTAT TGCCTGCGGC
GGCGGCGGTA TTCCGGTGGT GCGAGACGGG CAACAACTGA AAGGTGTGGA AGCCGTGATC
GATAAAGATC TGGCCTCCGC GCTGCTCGCC AGTCAGTTAG GCGCAGATCT GCTGGTGATC
CCCACCGGTG TAGAAAAAGT AGCGATTAAC TTTGGTACAC CACAACAACA GTGGCTCGAC
GCTATCAGCG TTGCCGAAGC GCAAACGCTG TTGCGGGAAG GTCAGTTTGG TGTCGGCAGT
ATGCAACCCA AAGTGGAAGC CATTGTTGAT TTCATCAATG CCAGCCAGCA ACAAGGCAAA
CAGGCCAGCG GCCTGATTAC TTCACCGCAA ACCATAAAAG CAGCCCTGGC GCATCAGAGC
GGCACATGGA TAACCCTTTA A
 
Protein sequence
AFYRSSIRVS PTNKRGSGKL VPGSPLRRWP VLSPLYVHWR RSSQRRTIMV KPLAVVAVGG 
NALIQDEQRN SIPDQYVAVM ESVQHIVDMV EAGWDLVLTH GNGPQVGFIL RRSELASNEV
SPVPLDYAVG DTQGAIGYMF QKALHNELAR RGINKPVIAL VTQTRVSPHD DAFASPSKPI
GAFLDEATAQ QRQQQLGWTL MEDAGRGWRR TVPSPAPLEI IEHDTIAHLV RQGYLVIACG
GGGIPVVRDG QQLKGVEAVI DKDLASALLA SQLGADLLVI PTGVEKVAIN FGTPQQQWLD
AISVAEAQTL LREGQFGVGS MQPKVEAIVD FINASQQQGK QASGLITSPQ TIKAALAHQS
GTWITL