Gene GSU2383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2383 
SymboltrpE 
ID2686590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2611347 
End bp2612822 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content67% 
IMG OID637127073 
Productanthranilate synthase component I 
Protein accessionNP_953429 
Protein GI39997478 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages
[TIGR01820] anthranilate synthase component I, archaeal clade 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTATTTCC CCGATGTGGA TACCTTTCGC GCCCTGGGCG CCCGGGGTAA CCTGATTCCC 
GTCTGCCGCG AGATCATGGC GGATATGGAC ACCCCGGTCA GTGCCTTCCG CAAGCTCGAC
GACGGCCGCT ATGCCTTCCT CCTGGAGAGT ATCGAGGGGG GCGAAAAGTG GGCCCGCTAT
ACCTTCCTCG GCGCCAGCCC CTCCACGGTC ATCCGGAGCC GGGGCACCAC CGTCGAGATC
ATCACCAACG GCGAGACCCG TTCCGTGACG ACGGACGACC CCCTCGGCTT TGTCCGCGAT
TTCCTGGCCC GGTTCCGGCC CGTGGAGATC CCCGGGCTTC CCCGCTTCTT CGGCGGCGCC
GTGGGGTACC TCGGCTACGA CATGGTCCGC CAATTCGAGC GGCTTCCCAC GGACAAACCC
GCCGTGATCG GCGCCTGGGA CTCCTGTTTC CTCATCACCG ACACCATCGT GATTTTCGAC
AATATGCGCC AGAAGATCAC GGTGGTCTCC AATGCCCACC TGGACGAGGG CGTTTCCGTC
GAAGCGGCCT ATGCCGACGC CGTTGCCCGG ATCGACGGGA TCATTGCCCG GCTCAAGGCG
CCGCTGCCGG CCCAGCCCGC CGCGGCCGCG GCCCGGAAGG TCTCCTTTTC TTCCAACATC
ACCCGGGAGG CGTTCGAGGA TGCCGTGGAA CGGGCCAAGG AGTACGTCCG GGCCGGCGAC
ATCATCCAGG TGGTCCTGTC CCAGCGTTTT TCCGGCGAGC TCACCGTGGA CCCTCTCGAC
ATCTACCGGG TATTGCGGAC CCTGAACCCG TCGCCCTACA TGTTTTTCCT CCGCCTGGAC
GATACCCTGG TGGTGGGCGC CTCTCCCGAA GTCATGGTGC GTCGGGAGGG GAACCGGGTG
GAGCTCCGCC CCATCGCCGG CACCCGCCCC CGGGGCGCCA CGCCGGAACA GGACGAGCAA
CTGGCGGAGG AACTCCTGGC CGACCCCAAG GAGCGGGCCG AGCACGTGAT GCTCGTGGAC
CTGGGGCGCA ACGACCTGGG GCGGGTCTGC CGCACCGGCA CCGTGAAGGT GTCGGAGCTC
ATGGTGATCG AGCGCTACTC CCACGTGATG CACATCGTTT CCAACGTCCA GGGCGAACTG
GCCGAGGGGA GGGACGCCTT CGACGTGGTG CGGGCCACCT TCCCGGCCGG CACCCTCTCC
GGCGCCCCCA AGGTGCGGGC CATGGAGATC ATCGACGAGC TGGAGCCGGT GCGCCGGGAA
GTCTACGGCG GCGCCGTGGG CTACTTCTCC TTCTCCGGCA CCATGGACCT GGCCATCGCC
ATCCGCACTC TCGTCATCCG CGACGGCGTG GTGCACCTCC AGGCCGGGGC CGGCATCGTG
GCCGACTCTG ATCCGGCCTC CGAGTACCAG GAGACGGTCA ACAAGGCCAT GGCCGTGGTA
AAGGCCATCG AGACCGCGGA AAAGGGGTTG GACTGA
 
Protein sequence
MYFPDVDTFR ALGARGNLIP VCREIMADMD TPVSAFRKLD DGRYAFLLES IEGGEKWARY 
TFLGASPSTV IRSRGTTVEI ITNGETRSVT TDDPLGFVRD FLARFRPVEI PGLPRFFGGA
VGYLGYDMVR QFERLPTDKP AVIGAWDSCF LITDTIVIFD NMRQKITVVS NAHLDEGVSV
EAAYADAVAR IDGIIARLKA PLPAQPAAAA ARKVSFSSNI TREAFEDAVE RAKEYVRAGD
IIQVVLSQRF SGELTVDPLD IYRVLRTLNP SPYMFFLRLD DTLVVGASPE VMVRREGNRV
ELRPIAGTRP RGATPEQDEQ LAEELLADPK ERAEHVMLVD LGRNDLGRVC RTGTVKVSEL
MVIERYSHVM HIVSNVQGEL AEGRDAFDVV RATFPAGTLS GAPKVRAMEI IDELEPVRRE
VYGGAVGYFS FSGTMDLAIA IRTLVIRDGV VHLQAGAGIV ADSDPASEYQ ETVNKAMAVV
KAIETAEKGL D