Gene Phep_4170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4170 
Symbol 
ID8255305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp5042563 
End bp5043975 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content42% 
IMG OID644937835 
ProductAnthranilate synthase 
Protein accessionYP_003094423 
Protein GI255534051 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000134842 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000986926 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGCATA TGAGTAATTA CATAATTAAA ACAACTTACA AGAAAAGACT GGCAGATACG 
ACCACACCGG TGAGTATTTA TTTGCGCCTT CGGGATGTAT TCCCCAATAC AATTTTGCTG
GAAAGTTCTG ACTACCATAG TCGTTCCAAT TCGGTAAGCT ATGTTTGTGC CGAGCCCATA
GCCGGTATTG TTTTGCAGGA TGGCCTGCTT TCAACCTATT TTCCCGATGG TAAAAAAGAG
GAAAAGAAGG ACTTTACGCT TACGGCAGAA ATTGATGCCT TTAAAGCAGC ATTTAAGCCG
GATGTAGTGG ATGATACAAG ATACATATCC AGCGGCTTGT TTGGGTATTT TACCTGGAAT
GCCGTACAGG AGTTTGAAGA CATTAAATTT ACTGCAAAAT CGCCAAAGGA TCAGGAAATT
CCAATAATGC AATACCATAT TTACCGGTAT ATTATTGCAA TAGACCACTT TAAAAATGAG
GTCACCCTGT TTAAAAATAC CTTTAACGGG GATGATAATG AAGATCTTGA AAAGATAGAA
TACATCATTC AGAACAAAAA TTTTCCGGAA TACAGCTTTA AAACGGATGG AGAAGAGCAT
TCGAACCTGA GTAATGAGGG TTTTATGGAA ATGGTAGAGC AGATGAAGAA GCACATTTTA
CGCGGCGATG TATTCCAGAT TGTGCCTTCC AGGGCCTACA ATCAGGGCTT CCTGGGGGAT
GAATTTAATG TTTACCGTTG CTTAAGGTCT ATTAATCCAT CACCTTACCT GTTCTATTTC
GATTATGGGA GTTTTAAACT TTTTGGTTCT TCTCCGGAAG CCCAGATCAC CATAAAAGAT
AATGTAGCCA ATATTTTCCC GATTGCAGGT ACGTTTAAGC GGACGGGAAA TGACGAGGAG
GATGCAGAAC TGGCGCGGAA ACTGGAACAG GACCCTAAAG AAAGTGCTGA ACATGTGATG
CTGGTAGATC TGGCCAGGAA TGACCTGAGC AGGCATTGCA GCGGGGTTGA GGTGAAATCA
TTTAAAGAAG TTCAATACTA TTCTCACCTG ATCCACCTGG TGTCTAAGGT AAGTGGTAAC
CTGCAGCCCA ATGTATCGGC CTTTAAGGTA GTTGCCGATA CTTATCCGGC GGGAACATTA
AGCGGAGCCC CTAAATACCG GGCCATGCAA CTGATTGATG AGTATGAAGG TTTGGGACGT
AATTTTTATG CCGGGGCCAT TGGATACATG GGCTTTAACG ACCAGTTTAA CCATGCCATT
ATGATCAGGA CTTTTATGAG TAAAAATAAC CAGTTGTATT ACAGGGCAGG GGCAGGTATA
GTGGCCGATT CTGTTGCTGT AAACGAACTG AATGAGGTAA ATAACAAAAT TGCGGCCTTG
CGCAAGGCCA TACAGATGGC TACAGATATT TAA
 
Protein sequence
MMHMSNYIIK TTYKKRLADT TTPVSIYLRL RDVFPNTILL ESSDYHSRSN SVSYVCAEPI 
AGIVLQDGLL STYFPDGKKE EKKDFTLTAE IDAFKAAFKP DVVDDTRYIS SGLFGYFTWN
AVQEFEDIKF TAKSPKDQEI PIMQYHIYRY IIAIDHFKNE VTLFKNTFNG DDNEDLEKIE
YIIQNKNFPE YSFKTDGEEH SNLSNEGFME MVEQMKKHIL RGDVFQIVPS RAYNQGFLGD
EFNVYRCLRS INPSPYLFYF DYGSFKLFGS SPEAQITIKD NVANIFPIAG TFKRTGNDEE
DAELARKLEQ DPKESAEHVM LVDLARNDLS RHCSGVEVKS FKEVQYYSHL IHLVSKVSGN
LQPNVSAFKV VADTYPAGTL SGAPKYRAMQ LIDEYEGLGR NFYAGAIGYM GFNDQFNHAI
MIRTFMSKNN QLYYRAGAGI VADSVAVNEL NEVNNKIAAL RKAIQMATDI