Gene Phep_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2072 
Symbol 
ID8253176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2390647 
End bp2391792 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content40% 
IMG OID644935720 
Productaminotransferase class I and II 
Protein accessionYP_003092339 
Protein GI255531967 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.235336 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGCAA TACAATCAAA ATTACCACAC ACCGGCACTA CTATCTTTAC AGTAATGTCA 
AAATTAGCTG AAGAACACAA TGCAATTAAT CTTTCTCAGG GTTTTCCTGA CTACGATTGC
GATCCAAAGT TATTAAGTTT TGTTACAGAG GCCATGCAAA AGGGCTTTAA TCAATATGCC
CCAATGCCGG GTTTGCCTGC ACTGAGGGAA TTAATTGCAG AAAAGGTGAG TAACCTGTAT
GGTGCAAATT ATCATCCGGA CACCGAAATA ACCATTACTG CAGGTGGTAC CCAGGCTATT
TTTACTGCCC TGAGCTCCTG TATCAGTTCG GGCGATGAGG TTATTATTTT TGAACCTGCT
TATGATTGTT ACGCCCCAAC GGTTAAATTA TTAGGGGGAC TGGTTAAACC TTATGAACTG
GCACCCCCAA ATTACGAAAT TGACTGGGAA ATGGTAAAGA AACTCTTTAC CGCAAACACC
AAAATGATCA TTCTGAACAG CCCGCAAAAC CCGACAGGCT GTATTTTATC AGAAAAGGAT
ATTAAAGCAC TGATTAAACT GACCAAAAAC ACAGACATCC TCATCTTAAG TGATGAGGTA
TATGAACACA TCATATTTGA CAACAACAAA CACCAAAGCA TTGCCTTATA TCCCGAATTA
AGGGAAAGAA GTTTTATTGC GGCTTCTTTT GGCAAATTAC TGCATGCAAC CGGATGGAAA
TTAGGTTATT GCCTGGCCCC CGAAAAACTG ATGAAGGAAT TTAGAAAAGT ACATCAGTTT
AACGTATTCA GTGTAAATAC GCCCATGCAG CTGGGCATTG CGAATTACCT GAAAGATGCC
GGAACATATA TGGGTTTATC TTCTTTTTTT CAGCAGAAAC GCGATTTCTT CCGGAAACTG
CTGGCAGAAA CCAATTTTAA TTTATTACCT TGTAACGGTT CTTATTTTCA ATGTGTTAGT
TACGGGCACC TAAGCGAAGA AAAGGATACA GACATGGCTA TAAGGCTGGT TAAGAATTAT
GGTGTGGCCA GTATCCCCGT CTCGGCCTTT TATACCAGGA ACCCGGACCA CCAGGTATTG
CGGTTTTGTT TTGCCAAAAA ACAAGAAACA TTGGAAAAAG CCGTTGAAAG ATTAATGAAA
TTATAA
 
Protein sequence
MIAIQSKLPH TGTTIFTVMS KLAEEHNAIN LSQGFPDYDC DPKLLSFVTE AMQKGFNQYA 
PMPGLPALRE LIAEKVSNLY GANYHPDTEI TITAGGTQAI FTALSSCISS GDEVIIFEPA
YDCYAPTVKL LGGLVKPYEL APPNYEIDWE MVKKLFTANT KMIILNSPQN PTGCILSEKD
IKALIKLTKN TDILILSDEV YEHIIFDNNK HQSIALYPEL RERSFIAASF GKLLHATGWK
LGYCLAPEKL MKEFRKVHQF NVFSVNTPMQ LGIANYLKDA GTYMGLSSFF QQKRDFFRKL
LAETNFNLLP CNGSYFQCVS YGHLSEEKDT DMAIRLVKNY GVASIPVSAF YTRNPDHQVL
RFCFAKKQET LEKAVERLMK L