Gene Phep_4083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4083 
Symbol 
ID8255217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4924101 
End bp4925816 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content41% 
IMG OID644937747 
ProductTetratricopeptide domain protein 
Protein accessionYP_003094336 
Protein GI255533964 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.446531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGA CAAAGAAAGC AATAACCTTA GGTTTAGGTT TAGTAGTGAT GGGTTCTGCC 
TCTTTTGCTC AGAGCTTAAG TGACGCAAAA AAGGCGATAG ATGCCGAGCA GTATCAGAAA
GCGACTTCGA TGCTTAAAAC ATTGGTGAGT TCACAGGCCA GCAAAGGCGA GAACTACTAT
AATTTAGGCG AGGTTTACCT GAAGATGGAT TATGTGGATT CGGCACGTGC AGTATTTACC
AAAGGTGTTA CAGCTGACCC AAAGAATTCA TTAAATTATA TTGGTCTGGG AGAGGCTGAT
CTGGTTTCCA ATAACCCGAC ATCAGCAAAA ACCAATTTTG CAAAGGCAGT GGAGGTTTCT
TCGAAAAAAG ATTGGATTCC TCAGTTGTAT ATAGGAAAAG CCTATATTGC TACTGATAAG
CCTGATTTTG AAGCCGCTTT GCCTTATTTA CAAAAAGCAG AAGAATTGGA TGCAAATGAT
AAAGATGCTG AAACTTTTAT TGCGCTAGGT GATTATTATG CCTTGCAAAA GAAAAATTCT
GAGGCTTTAC AAAACTACAT GAGAGCGGGA AATATCAATG AGAGTATTTT AAGGGCCCCT
GTTCAGATTG GAAGAATGTA CAAAGAATCA AGAGCATTTA CCGAGTCTGA GCAGCAATTA
AAAGAGGTGA TCGCTAAAGA TGCCAATTAT GGACCTGCTT ACAGGGAGAT CGCTGAACTT
TATATGCAGT GGGCTATCCA GATGCCAAAC GAATTTGCAG CTAAATCTGC CCAGGCCCTG
GAAAACTATA AAAAATATCT TGACCTGACC GACAAATCTT ATGAATCAAG ATTGCGTTAT
GCCCAGTTTC TGTTCTATGC AAAGGATTTT AAAACCCTTG AGCAGGAATC TGCAACTCTG
CAGCAAATGA ACCCCAATGA CCCTAAAAAT CTTGTGGTAA CAAGGTTGCG TGGTTATTCG
GCATTCGAGA ACAAGAACTT CCCTCAAAGT TTGGAATACA TGAACGATTT CTTTGCAAAA
GCCAAAGATA CAAGCCGTAT TGTAGCTTCA GATTACTTAT ATCTGGGAAG GGCTCAGCTG
GCAGCCGGAA ACGATAGCCT GGCCTTGATC AACATTACCA AAGCAGTTGA GAAGGATTCT
ACGAATGTTG AAGCATTGGC TGATGTGGCA AAAGCTTATT TTGATGCTAA AAAATATGCG
AAATCTGCAC AGGTGTACGA CAAGGTGATC AAAGCTGGTC CGAATGCAAA AGGCGTATTG
TACAGCTATT TTTATGATGG CCTGGCTTAT TATTTCGATT ACGCCAATCA GTACAGTGCC
AAATTGAACC CTTCAAAAGA CCTGTTGGTT AAAGCTGATT CTGCTATTGC AAAAGTAGCG
CAGCTTGCCC CTGAAACCAC GGATGCTTAT CTGTACAGAG GACGGATCAA TAACCTGCTT
GACGATGAAG CCAATCCTAA GGGACTATTG GTGCCTCATT ATGAAATGTT CATCAAAAAA
GTAACCGTTG ACAAGCCTGA TCTTGCTCCT GCAAATGCCA GAAAATTATC TGAAGCATAT
GATAACCTGG CAGGTTTTTA TGCGATAAGC GATAAGGCAA AAGCTATTGA CTACCTGAAT
AAAAGTATTG CGGCTAACCC GGCTGGTACT TTTGCTCCGG CTAAACTGAA AGAGTTAACC
GCTCCTGCAG CTAAGGCCCC GGTAAAGAAA AAGTAA
 
Protein sequence
MKMTKKAITL GLGLVVMGSA SFAQSLSDAK KAIDAEQYQK ATSMLKTLVS SQASKGENYY 
NLGEVYLKMD YVDSARAVFT KGVTADPKNS LNYIGLGEAD LVSNNPTSAK TNFAKAVEVS
SKKDWIPQLY IGKAYIATDK PDFEAALPYL QKAEELDAND KDAETFIALG DYYALQKKNS
EALQNYMRAG NINESILRAP VQIGRMYKES RAFTESEQQL KEVIAKDANY GPAYREIAEL
YMQWAIQMPN EFAAKSAQAL ENYKKYLDLT DKSYESRLRY AQFLFYAKDF KTLEQESATL
QQMNPNDPKN LVVTRLRGYS AFENKNFPQS LEYMNDFFAK AKDTSRIVAS DYLYLGRAQL
AAGNDSLALI NITKAVEKDS TNVEALADVA KAYFDAKKYA KSAQVYDKVI KAGPNAKGVL
YSYFYDGLAY YFDYANQYSA KLNPSKDLLV KADSAIAKVA QLAPETTDAY LYRGRINNLL
DDEANPKGLL VPHYEMFIKK VTVDKPDLAP ANARKLSEAY DNLAGFYAIS DKAKAIDYLN
KSIAANPAGT FAPAKLKELT APAAKAPVKK K