Gene B21_00939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00939 
SymbolaspC 
ID8116196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp989020 
End bp990210 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content52% 
IMG OID644847201 
Producthypothetical protein 
Protein accessionYP_002998774 
Protein GI251784470 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1448] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00958049 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGAGA ACATTACCGC CGCTCCTGCC GACCCGATTC TGGGCCTGGC CGATCTGTTT 
CGTGCCGATG AACGTCCCGG CAAAATTAAC CTCGGGATTG GTGTCTATAA AGATGAGACG
GGCAAAACCC CGGTACTGAC CAGCGTGAAA AAGGCTGAAC AGTATCTGCT CGAAAATGAA
ACCACCAAAA ATTACCTCGG CATTGACGGC ATCCCTGAAT TTGGTCGCTG CACTCAGGAA
CTGCTGTTTG GTAAAGGTAG CGCCCTGATC AATGACAAAC GTGCTCGCAC GGCACAGACT
CCGGGGGGCA CTGGCGCACT ACGCGTGGCT GCCGATTTCC TGGCAAAAAA TACCAGCGTT
AAGCGTGTGT GGGTGAGCAA CCCAAGCTGG CCGAACCATA AGAGCGTCTT TAACTCTGCA
GGTCTGGAAG TTCGTGAATA CGCTTATTAT GATGCGGAAA ATCACACTCT TGACTTCGAT
GCACTGATTA ACAGCCTGAA TGAAGCTCAG GCTGGCGACG TAGTGCTGTT CCATGGCTGC
TGCCATAACC CAACCGGTAT CGACCCTACG CTGGAACAAT GGCAAACACT GGCACAACTC
TCCGTTGAGA AAGGCTGGTT ACCGCTGTTT GACTTCGCTT ACCAGGGTTT TGCCCGTGGT
CTGGAAGAAG ATGCTGAAGG ACTGCGCGCT TTCGCGGCTA TGCATAAAGA GCTGATTGTT
GCCAGTTCCT ACTCTAAAAA CTTTGGCCTG TACAACGAGC GTGTTGGCGC TTGTACTCTG
GTTGCTGCCG ACAGTGAAAC CGTTGATCGC GCATTCAGCC AAATGAAAGC GGCGATTCGC
GCTAACTACT CTAACCCACC AGCACACGGC GCTTCTGTTG TTGCCACCAT CCTGAGCAAC
GATGCGTTAC GTGCGATTTG GGAACAAGAG CTGACTGATA TGCGCCAGCG TATTCAGCGT
ATGCGTCAGT TGTTCGTCAA TACGCTGCAG GAAAAAGGCG CAAACCGCGA CTTCAGCTTT
ATCATCAAAC AGAACGGCAT GTTCTCCTTC AGTGGCCTGA CAAAAGAACA AGTGCTGCGT
CTGCGCGAAG AGTTTGGCGT ATATGCGGTT GCTTCTGGTC GCGTAAATGT GGCCGGGATG
ACACCAGATA ACATGGCTCC GCTGTGCGAA GCGATTGTGG CAGTGCTGTA A
 
Protein sequence
MFENITAAPA DPILGLADLF RADERPGKIN LGIGVYKDET GKTPVLTSVK KAEQYLLENE 
TTKNYLGIDG IPEFGRCTQE LLFGKGSALI NDKRARTAQT PGGTGALRVA ADFLAKNTSV
KRVWVSNPSW PNHKSVFNSA GLEVREYAYY DAENHTLDFD ALINSLNEAQ AGDVVLFHGC
CHNPTGIDPT LEQWQTLAQL SVEKGWLPLF DFAYQGFARG LEEDAEGLRA FAAMHKELIV
ASSYSKNFGL YNERVGACTL VAADSETVDR AFSQMKAAIR ANYSNPPAHG ASVVATILSN
DALRAIWEQE LTDMRQRIQR MRQLFVNTLQ EKGANRDFSF IIKQNGMFSF SGLTKEQVLR
LREEFGVYAV ASGRVNVAGM TPDNMAPLCE AIVAVL