Gene Dshi_1796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1796 
SymboltrpE 
ID5712784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1867111 
End bp1868640 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content69% 
IMG OID641267716 
Productanthranilate synthase component I 
Protein accessionYP_001533139 
Protein GI159044345 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00033121 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.176001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTGA CCCCCAGTTT CGACGCCTTC GAGGCCGCCC ATGGGCGCGG CGAGAACCAG 
GTGGTCTACA CCCGGCTGGC GGCGGATCTG GACACGCCCG TGTCGCTGAT GCTGAAGCTG
GCGGGCGCGC GCAAGGACAG CTTCATGCTG GAATCCGTCA CCGGCGGCGA GGTGCGCGGG
CGCTACTCGG TAGTGGGGCT GAAGCCCGAT CTGATCTGGG AGTGTCGCGG CACCGGTGCG
CGGATCAACC GCTCCGCCCG GTTCGATGCG GAAGCGTTCG AAGACATCCC CGGCGACCCG
CTGGCCGCAT TGCGCGCGTT GATCGCCGAG AGCCGGATCG AGATGCCCGA TGAGCTGCCC
GCGATCGCGG CGGGGCTGTT CGGCTATCTG GGCTATGACA TGATCCGGTT GGTGGAGCAT
CTGCCGAATG TGAACCCCGA CCCGCTGGGC CTGCCGGACG CGGTGCTGGT GCGGCCCTCG
GTGGTGGCGG TTCTGGACGG GGTGAAGGGC GAGGTCACGG TGGTCTCGCC GGTCTGGGCA
AACGGTTCGG CGGGGGGCGG CCAATCGGCG CGGGCCGCTT ATGCCCAGGC GGCGGAGCGG
GTGATGGACG CGGTGCGCGA TCTGGAACGC TCCGCCATGG CCGAACGGCG CGATCTGGGC
GAGGCGGCCG AGCTCGGAGA GCCGGTCTCG AACTTCGCCC ATGCCGATTA CCTGGCCGCG
GTGGAGCGCG CCAAGGACTA CATCCGCGCC GGGGACATCT TCCAGGTGGT GCCGTCCCAG
CGTTGGTCCC AGGCCTTCCC CCTGCCGCCC TTCGCGCTTT ACCGCAGCTT GCGGCGAACA
AACCCCTCGC CCTTCATGTT CTATTTCAAC TTCGGCGGGT TCCAGATCGT GGGCGCCAGC
CCCGAGATCC TGGTGCGGGT GTTCGGCCGC GAGGTGACGA TCCGGCCGAT CGCGGGCACC
CGGCCACGGG GTGCGACACC GGCCGAGGAT GACGCGCTGG AGGCGGATCT GCTGGCAGAT
GCCAAGGAAT GCGCCGAGCA CCTGATGCTG CTCGACCTGG GGCGCAACGA TGTGGGCCGG
GTCGCCAAGA TCGGCACCGT GCGCCCGACC GAACAGTTCA TCATCGAGCG TTATTCCCAC
GTGATGCATA TCGTGTCGAA CGTGGTGGGC GAACTGAGCG AGGAGCATGA CGCGCTCTCG
GCGCTGCTGG CGGGGTTGCC GGCGGGCACG GTCTCGGGCG CGCCCAAGGT GCGGGCGATG
GAGATCATCG ACGAGCTGGA GCCGGAAAAG CGCGGGGTCT ATGGCGGGGG CTGCGGGTAT
TTCGCGGCCA ATGGCGACAT GGACATGTGC ATCGCCCTGC GCACGGCGGT GGTCAAGGAC
GAGACGCTCT ATATCCAGGC CGGGGGCGGC GTGGTCTATG ACAGCGACCC GGAGGCGGAG
TTCCAGGAGA CGGTGAACAA GGCCAAGGCG ATCCGGATGG CGGCGCAGCA GGCCGGGTTG
TTCGCGGGGC CCGCCGGGCG GAACGGCTGA
 
Protein sequence
MALTPSFDAF EAAHGRGENQ VVYTRLAADL DTPVSLMLKL AGARKDSFML ESVTGGEVRG 
RYSVVGLKPD LIWECRGTGA RINRSARFDA EAFEDIPGDP LAALRALIAE SRIEMPDELP
AIAAGLFGYL GYDMIRLVEH LPNVNPDPLG LPDAVLVRPS VVAVLDGVKG EVTVVSPVWA
NGSAGGGQSA RAAYAQAAER VMDAVRDLER SAMAERRDLG EAAELGEPVS NFAHADYLAA
VERAKDYIRA GDIFQVVPSQ RWSQAFPLPP FALYRSLRRT NPSPFMFYFN FGGFQIVGAS
PEILVRVFGR EVTIRPIAGT RPRGATPAED DALEADLLAD AKECAEHLML LDLGRNDVGR
VAKIGTVRPT EQFIIERYSH VMHIVSNVVG ELSEEHDALS ALLAGLPAGT VSGAPKVRAM
EIIDELEPEK RGVYGGGCGY FAANGDMDMC IALRTAVVKD ETLYIQAGGG VVYDSDPEAE
FQETVNKAKA IRMAAQQAGL FAGPAGRNG