Gene B21_03722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03722 
SymbolyiiD 
ID8114563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3982115 
End bp3983104 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content55% 
IMG OID644849883 
Producthypothetical protein 
Protein accessionYP_003001456 
Protein GI251787152 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1246] N-acetylglutamate synthase and related acetyltransferases 
TIGRFAM ID[TIGR02447] thioesterase domain, putative 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.645417 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGC TTCCAGGGTT GTCACGGGAA ACAAGAGAGA GTATCGCTAT GTATCACCTT 
CGGGTTCCAC AAACAGAAGA AGAATTAGAG CGTTACTATC AGTTTCGCTG GGAAATGTTG
CGTAAGCCCC TGCATCAACC AAAAGGTTCG GAACGCGACG CGTGGGATGC GATGGCGCAT
CACCAGATGG TCGTCGACGA GCAGGGTAAT CTGGTGGCGG TAGGCCGACT GTATATTAAT
GCCGACAATG AAGCGTCCAT TCGCTTTATG GCCGTTCATC CCGACGTGCA GGACAAAGGG
TTAGGCACGC TGATGGCGAT GACCCTGGAG TCGGTGGCGC GTCAGGAAGG CGTTAAGCGC
GTGACCTGTA GCGCCCGTGA AGACGCGGTG GAGTTTTTCG CCAAGCTGGG GTTTGTTAAT
CAGGGAGAAA TCACCACGCC AACCACCACG CCGATTCGCC ATTTTTTGAT GATTAAGCCC
GTCGCCACTC TGGATGACAT TCTGCATCGC GGCGACTGGT GCGCGCAGCT GCAACAGGCG
TGGTACGAAC ATATCCCGCT TAGTGAAAAA ATGGGCGTGC GCATTCAGCA ATATACCGGG
CAAAAATTTA TCACTACCAT GCCAGAAACC GGCAATCAGA ATCCGCACCA TACGCTGTTT
GCCGGGAGTT TATTCTCACT GGCGACGCTC ACCGGTTGGG GACTTATCTG GCTGATGCTG
CGTGAACGCC ACCTCGGCGG AACGATTATT CTTGCGGATG CGCATATCCG CTACAGCAAG
CCGATTAGCG GTAAACCTCA TGCGGTAGCC GACCTCGGTG CCTTAAGCGG CGATCTCGAC
CGTCTGGCGC GCGGACGAAA AGCACGGGTG CAGATGCAGG TCGAAATCTT TGGCGACGAG
ACGCCGGGTG CAGTGTTTGA AGGCACGTAT ATCGTTCTGC CCGCGAAGCC ATTTGGCCCG
TATGAAGAGG GCGGGAACGA AGAAGAGTAG
 
Protein sequence
MSQLPGLSRE TRESIAMYHL RVPQTEEELE RYYQFRWEML RKPLHQPKGS ERDAWDAMAH 
HQMVVDEQGN LVAVGRLYIN ADNEASIRFM AVHPDVQDKG LGTLMAMTLE SVARQEGVKR
VTCSAREDAV EFFAKLGFVN QGEITTPTTT PIRHFLMIKP VATLDDILHR GDWCAQLQQA
WYEHIPLSEK MGVRIQQYTG QKFITTMPET GNQNPHHTLF AGSLFSLATL TGWGLIWLML
RERHLGGTII LADAHIRYSK PISGKPHAVA DLGALSGDLD RLARGRKARV QMQVEIFGDE
TPGAVFEGTY IVLPAKPFGP YEEGGNEEE