Gene B21_00002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00002 
SymbolthrA 
ID8112805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp336 
End bp2798 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content53% 
IMG OID644846297 
Producthypothetical protein 
Protein accessionYP_002997870 
Protein GI251783566 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.964701 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGTGT TGAAGTTCGG CGGTACATCA GTGGCAAATG CAGAACGTTT TCTGCGGGTT 
GCCGATATTC TGGAAAGCAA TGCCAGGCAG GGGCAGGTGG CCACCGTCCT CTCTGCCCCC
GCCAAAATCA CCAACCACCT GGTGGCGATG ATTGAAAAAA CCATTAGCGG CCAGGATGCT
TTACCCAATA TCAGCGATGC CGAACGTATT TTTGCCGAAC TTTTGACGGG ACTCGCCGCC
GCCCAGCCGG GATTCCCGCT GGCGCAATTG AAAACTTTCG TCGATCAGGA ATTTGCCCAA
ATAAAACATG TCCTGCATGG CATTAGTTTG TTGGGGCAGT GCCCGGATAG CATCAACGCT
GCGCTGATTT GCCGTGGCGA GAAAATGTCG ATCGCCATTA TGGCCGGCGT ATTAGAAGCG
CGCGGTCACA ACGTTACCGT TATCGATCCG GTCGAAAAAC TGCTGGCAGT GGGGCATTAC
CTCGAATCTA CCGTCGATAT TGCTGAGTCC ACCCGCCGTA TTGCGGCAAG TCGCATTCCG
GCTGATCACA TGGTGCTGAT GGCAGGTTTC ACCGCCGGTA ATGAAAAAGG CGAACTGGTG
GTACTTGGAC GCAACGGTTC CGACTACTCC GCGGCGGTGC TGGCTGCCTG TTTACGCGCC
GATTGTTGCG AGATTTGGAC GGACGTTGAC GGGGTCTATA CCTGCGACCC GCGTCAGGTG
CCCGATGCGA GGTTGTTGAA GTCGATGTCC TACCAGGAAG CGATGGAGCT TTCCTACTTC
GGCGCTAAAG TTCTTCACCC CCGCACCATT ACCCCCATCG CCCAGTTCCA GATCCCTTGC
CTGATTAAAA ATACCGGAAA TCCTCAAGCT CCAGGTACGC TCATTGGTGC CAGCCGTGAT
GAAGACGAAT TACCGGTCAA GGGCATTTCC AATCTGAATA ATATGGCAAT GTTCAGCGTT
TCCGGCCCGG GGATGAAAGG GATGGTTGGC ATGGCGGCGC GCGTGTTTGC AGCGATGTCA
CGCGCCCGTA TTTCCGTGGT GCTGATTACG CAATCATCTT CCGAATACAG TATCAGTTTC
TGCGTTCCGC AAAGCGACTG TGTGCGAGCT GAACGGGCAA TGCAGGAAGA GTTCTACCTG
GAACTGAAAG AAGGCTTACT GGAGCCGCTG GCGGTGACGG AACGGCTGGC CATTATCTCG
GTGGTAGGTG ATGGTATGCG CACCTTGCGT GGGATCTCGG CGAAATTCTT TGCCGCGCTG
GCCCGCGCCA ATATCAACAT TGTCGCCATT GCTCAGGGAT CTTCTGAACG CTCAATCTCT
GTCGTGGTAA ATAACGATGA TGCGACCACT GGCGTGCGCG TTACTCATCA GATGCTGTTC
AATACCGATC AGGTTATCGA AGTGTTTGTG ATTGGCGTCG GTGGCGTTGG CGGTGCGCTG
CTGGAGCAAC TGAAGCGTCA ACAAAGCTGG CTGAAGAATA AACATATCGA CTTACGTGTC
TGCGGTGTTG CCAACTCGAA GGCACTGCTC ACCAATGTGC ATGGCCTAAA TCTGGAAAAC
TGGCAGGAAG AACTGGCGCA AGCCAAAGAG CCGTTTAATC TCGGGCGCTT AATTCGCCTC
GTGAAAGAAT ATCATCTGCT GAACCCGGTC ATTGTTGACT GCACTTCCAG CCAGGCAGTG
GCGGATCAAT ATGCCGACTT CTTGCGCGAA GGTTTCCACG TTGTCACGCC GAACAAAAAG
GCCAACACCT CGTCGATGGA TTACTACCAT CTGTTGCGTC ATGCGGCGGA AAAATCGCGG
CGTAAATTCC TCTATGACAC CAACGTTGGG GCTGGATTAC CGGTTATTGA GAACCTGCAA
AATCTGCTCA ATGCTGGTGA TGAATTGATG AAGTTCTCCG GCATTCTTTC AGGTTCGCTT
TCTTATATCT TCGGCAAGTT AGACGAAGGC ATGAGTTTCT CCGAGGCGAC TACTCTGGCG
CGGGAAATGG GTTATACCGA ACCGGATCCG CGAGATGATC TTTCTGGTAT GGATGTAGCG
CGTAAGCTAT TGATTCTCGC TCGTGAAACG GGACGTGAAC TGGAGCTGGC GGATATTGAA
ATTGAACCTG TGCTGCCCGC AGAGTTTAAC GCTGAGGGTG ATGTTGCCGC TTTTATGGCG
AATCTGTCAC AGCTCGACGA TCTCTTTGCC GCGCGCGTGG CGAAGGCCCG TGATGAAGGA
AAAGTTTTGC GCTATGTTGG CAATATTGAT GAAGATGGTG CCTGCCGCGT GAAGATTGCC
GAAGTGGATG GTAATGATCC GCTGTTCAAA GTGAAAAATG GCGAAAACGC CCTGGCCTTT
TATAGCCACT ATTATCAGCC GCTGCCGTTG GTGCTGCGCG GATATGGTGC GGGCAATGAC
GTTACAGCTG CCGGTGTCTT TGCCGATCTG CTACGTACCC TCTCATGGAA GTTAGGAGTC
TGA
 
Protein sequence
MRVLKFGGTS VANAERFLRV ADILESNARQ GQVATVLSAP AKITNHLVAM IEKTISGQDA 
LPNISDAERI FAELLTGLAA AQPGFPLAQL KTFVDQEFAQ IKHVLHGISL LGQCPDSINA
ALICRGEKMS IAIMAGVLEA RGHNVTVIDP VEKLLAVGHY LESTVDIAES TRRIAASRIP
ADHMVLMAGF TAGNEKGELV VLGRNGSDYS AAVLAACLRA DCCEIWTDVD GVYTCDPRQV
PDARLLKSMS YQEAMELSYF GAKVLHPRTI TPIAQFQIPC LIKNTGNPQA PGTLIGASRD
EDELPVKGIS NLNNMAMFSV SGPGMKGMVG MAARVFAAMS RARISVVLIT QSSSEYSISF
CVPQSDCVRA ERAMQEEFYL ELKEGLLEPL AVTERLAIIS VVGDGMRTLR GISAKFFAAL
ARANINIVAI AQGSSERSIS VVVNNDDATT GVRVTHQMLF NTDQVIEVFV IGVGGVGGAL
LEQLKRQQSW LKNKHIDLRV CGVANSKALL TNVHGLNLEN WQEELAQAKE PFNLGRLIRL
VKEYHLLNPV IVDCTSSQAV ADQYADFLRE GFHVVTPNKK ANTSSMDYYH LLRHAAEKSR
RKFLYDTNVG AGLPVIENLQ NLLNAGDELM KFSGILSGSL SYIFGKLDEG MSFSEATTLA
REMGYTEPDP RDDLSGMDVA RKLLILARET GRELELADIE IEPVLPAEFN AEGDVAAFMA
NLSQLDDLFA ARVAKARDEG KVLRYVGNID EDGACRVKIA EVDGNDPLFK VKNGENALAF
YSHYYQPLPL VLRGYGAGND VTAAGVFADL LRTLSWKLGV