Gene SO_3019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_3019 
SymboltrpE 
ID1170706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp3129858 
End bp3131582 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content50% 
IMG OID637344828 
Productanthranilate synthase component I 
Protein accessionNP_718587 
Protein GI24374544 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCTAA AGACATTTAA TCAGGTTACC CAAGCCGATA GAGCGAATTT AGCCTCGTCT 
CAACAGACAT TCGCACGTTC ACATACGCTC AAAGCCACCC TGGTATACCA TAGCGATCCA
CTGCGTCTGT ACCAGCACAT CACTGAAGAT GCGCCCCATA CCATGTTGTT AGAATCAGCG
GAAATCAACA GTAAGGAAAA TCTTAAAAGC ATGGTGATGA CCCATGCGGC ACTGATGATC
CGCTGCGACG GTTATCGTTT ACGCTTTAGC GCACTGACCG ACAATGGTGC CAGTTTGCTC
ACGCCCATTG AGCAATTTTT TATGGCGCGT TCATGCCATA CACAATGCCA ACGCAATGGC
CAACACTTAG TGGTGACGCT GCAAAAAGAC ACTGAGCTTA AGGATGAAGA TGCGCGCTTA
AAATCCACCT CGCCCCTCGA TGGTTTGCGC TTGTTTGTTA AACATATCGA CTGCGGCGCT
CATACTGACA GCCAATCCAA GCCAGCGTTC GAAGACTTAT TTTTAGGTGG CGTGCTTTCC
TACGATTTGA TTGATACCGT CGAGCCGCTG CCAGAAGCCC CGAATGGTGC AAATGATTGT
CCTGATTATT TATTTTATCT CGCCGAAACC TTAATTCTTA TCGATCACAA ACAAAAACAA
GCCGAGATTA TCACCCACAA CTTCAGTGAA AGCGCAGAAC AACATTCAGA GGTGACCCAA
GCCTTAGCCG AGCGAGTTGA AAACATCCGC GCCCAATGTG AAGCCTTAGC CAAGAGTGCA
ACGCCTGCGC CTGCCCTCGT TGGCATAACA GCCACAGAGC AAGTGAATGT CAGTGATGAG
GCCTTTAAAC AAACTGTTAT CGATTTAAAA GAACACATTA TTGCGGGCGA TATCTTCCAA
GTGGTGCCTT CTCGCAGTTT TAGCCTGCCC TGCCCGAATA CCTTAGGTGC TTACCGCGCG
CTGCGTCTAA CCAATCCTAG CCCCTATATG TTTTATTTCA GGGGAAATGA TTTCACCCTG
TTTGGCGCCT CGCCAGAAAG CGCGCTGAAA TTTGATTCCA GCAACAATCA GGTCGAAGTC
TATCCAATCG CAGGTACCCG TAAACGCGGC AAAACCGCCA GTGGCGAGAT TGATTTCGAC
CTCGATAGCC GCATCGAACT CGAACTGCGT TTAGATAAAA AAGAGTTATC TGAACATTTA
ATGCTGGTCG ATTTGGCTCG CAACGATATC GCCCGAATCA GCCAAAGCGG CAGTCGTAAA
GTGGCTGAGT TACTTAAAGT CGACCGCTAC TCCCACGTCA TGCACCTTGT GAGCCGCGTC
ACCGGCCAAT TACGCCAAGA CTTAGATGCG CTCCACGCCT ACCAAGCCTG TATGAACATG
GGCACTTTAG TTGGCGCGCC CAAAGTTCGT GCTTCGCAAT TAGTGCGTCA GGCAGAAAAG
ACCCGCCGAG GCAGCTATGG CGGCGCCGTG GGCTACCTCA ATGCCCTTGG CGATATGGAT
ACCTGCATTG TGATCCGCTC CGCTTTTGTT AAAGACGGTG TAGCCCATAT CCAAGCCGGT
GCAGGAGTGG TGTTTGACTC CGATCCACAA AGTGAAGCCG ATGAAACCCG CCAAAAGGCG
CAAGCGGTGA TTTCGGCCAT CAAAATGGGC GCAGGTTTAG CAGGCATAAA TAACTGCAAC
GAGCACACTT CCACAAAAGT CTCAACAGCA GCGCAGCAAG GATAA
 
Protein sequence
MTLKTFNQVT QADRANLASS QQTFARSHTL KATLVYHSDP LRLYQHITED APHTMLLESA 
EINSKENLKS MVMTHAALMI RCDGYRLRFS ALTDNGASLL TPIEQFFMAR SCHTQCQRNG
QHLVVTLQKD TELKDEDARL KSTSPLDGLR LFVKHIDCGA HTDSQSKPAF EDLFLGGVLS
YDLIDTVEPL PEAPNGANDC PDYLFYLAET LILIDHKQKQ AEIITHNFSE SAEQHSEVTQ
ALAERVENIR AQCEALAKSA TPAPALVGIT ATEQVNVSDE AFKQTVIDLK EHIIAGDIFQ
VVPSRSFSLP CPNTLGAYRA LRLTNPSPYM FYFRGNDFTL FGASPESALK FDSSNNQVEV
YPIAGTRKRG KTASGEIDFD LDSRIELELR LDKKELSEHL MLVDLARNDI ARISQSGSRK
VAELLKVDRY SHVMHLVSRV TGQLRQDLDA LHAYQACMNM GTLVGAPKVR ASQLVRQAEK
TRRGSYGGAV GYLNALGDMD TCIVIRSAFV KDGVAHIQAG AGVVFDSDPQ SEADETRQKA
QAVISAIKMG AGLAGINNCN EHTSTKVSTA AQQG