Gene SbBS512_E3389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3389 
SymbolansB 
ID6272479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3152905 
End bp3153951 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content52% 
IMG OID641727280 
ProductL-asparaginase II 
Protein accessionYP_001881730 
Protein GI187732586 
COG category[E] Amino acid transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0252] L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D 
TIGRFAM ID[TIGR00520] L-asparaginases, type II 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTTTT TCAAAAAGAC GGCACTTGCC GCACTGGTTA TGGGTTTTAG TGGTGCAGCA 
TTGGCATTAC CCAATATCAC CATTTTAGCA ACCGGCGGGA CCATTGCCGG TGGTGGTGAC
TCCGCAACCA AATCTAACTA CACAGCGGGT AAAGTTGGCG TAGAAAATCT GGTTAATGCG
GTGCCGCAAC TAAAAGACAT TGCGAACGTT AAAGGCGAGC AGGTAGTGAA TATCGGCTCC
CAGGACATGA ACGATAATGT CTGGCTGACA CTGGCGAAAA AAATTAACAC CGACTGCGAT
AAAACCGACG GCTTCGTCAT TACCCACGGT ACCGACACGA TGGAAGAAAC CGCTTACTTC
CTCGACCTGA CGGTGAAATG CGACAAACCG GTGGTGATGG TCGGCGCAAT GCGCCCGTCC
ACGTCCATGA GCGCAGACGG TCCATTCAAC CTGTATAACG CGGTAGTGAC CGCAGCTGAT
AAAGCATCCG CTAATCGTGG CGTGCTGGTG GTGATGAACG ACACTGTACT GGACGGTCGC
GATGTAACCA AAACCAACAC CACCGACGTA GCGACCTTCA AGTCTGTTAA CTACGGTCCT
CTGGGATACA TTCACAACGG TAAGATTGAC TACCAACGTA CCCCGGCACG TAAGCACACC
AGCGATACGC CATTCGATGT CTCTAAGCTG AATGAGCTGC CGAAAGTCGG CATCGTTTAT
AACTACGCTA ACGCATCCGA TCTTCCGGCT AAAGCACTGG TAGATGCGGG CTATGATGGC
ATCGTTAGCG CTGGTGTGGG TAATGGTAAC CTGTATAAAT CCGTGTTCGA CACCCTGGCA
ACCGCCGCGA AAAACGGCAC TGCAGTAGTG CGTTCTTCCC GCGTACCGAC GGGTGCTACC
ACTCAGGATG CTGAAGTGGA TGATGCGAAA TACGGCTTCG TCGCCTCTGG CACGCTGAAC
CCGCAAAAAG CGCGCGTCCT GCTGCAGCTG GCTCTGACGC AAACCAAAGA TCCGCAGCAG
ATCCAGCAGA TCTTCAATCA GTACTAA
 
Protein sequence
MEFFKKTALA ALVMGFSGAA LALPNITILA TGGTIAGGGD SATKSNYTAG KVGVENLVNA 
VPQLKDIANV KGEQVVNIGS QDMNDNVWLT LAKKINTDCD KTDGFVITHG TDTMEETAYF
LDLTVKCDKP VVMVGAMRPS TSMSADGPFN LYNAVVTAAD KASANRGVLV VMNDTVLDGR
DVTKTNTTDV ATFKSVNYGP LGYIHNGKID YQRTPARKHT SDTPFDVSKL NELPKVGIVY
NYANASDLPA KALVDAGYDG IVSAGVGNGN LYKSVFDTLA TAAKNGTAVV RSSRVPTGAT
TQDAEVDDAK YGFVASGTLN PQKARVLLQL ALTQTKDPQQ IQQIFNQY