Gene Phep_3867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3867 
Symbol 
ID8255001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4641874 
End bp4645224 
Gene Length3351 bp 
Protein Length1116 aa 
Translation table11 
GC content43% 
IMG OID644937531 
ProductTPR repeat-containing protein 
Protein accessionYP_003094120 
Protein GI255533748 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.409704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAT TATTTGTAAA TGTTTGGGAA GAACAGGTCA GTATCCCTAC ATATCCAACG 
GGTAAGCCGG AAAAAAATCC AATGTTTTTT GAAAAGCGGA TTTATCAGGG CAGCAGTGGT
GTTGTTTATC CAAACCCGGT GATCGAAAAA ATCTATGATG ACAAAGAAGA TAAAAGTTAT
ACTGCCCTCT ACCTGGAAAA TAAATACCTG AAAATCATGA TCCTTCCTGA ATTAGGTGGA
AGGGTACATA TGGCCTATGA TAAGATAAAG CAGCGTCATT TTATTTATTA TAACCAAGTG
ATCAAACCTG CACTTGTTGG TCTTTGCGGG CCCTGGATAT CGGGCGGAAT AGAGTTTAAC
TGGCCTCAAC ATCATCGTCC GAGTACATTT GAGCCTGTTG ATTATGCCAT TGAAGAAAAT
GCGGATGGAA GTAAAACGGT ATGGGTTAAT GAGGTAGAGA AAATGTTCCT TACCAAGGGG
ATGGCAGGTT TTACCCTACA TCCTGATAAA GCCTACCTCG AAATTAAAGG GAAACTGTAT
AACCGGACCA GTCTGCCGCA AACATTTTTA TGGTGGGCAA ACCCTGCGGT AAAGGTAAAC
GACGATTATC AGTCGGTTTT TCCGCCTGAT GTAAACGCTG TTTTTGACCA TGGTAAACGT
GACGTTTCTA CCTTTCCAAT AGCAACGGGT ACTTATTATA AAGTTGATTA CGCCCCCGGA
ACAGATATCT CCAGATACAG GAATATTCCG GTACCTACTT CTTATATGGC CATCCGTTCC
GATTATGATT TTGTGGGTGG TTATGAAAAT GATACCGAAG CAGGGCTGTT ACATGTGGCC
AGCCACCACA TCTCGCCGGG TAAAAAACAG TGGACCTGGG GCAATAGTGA CTTTGGGAAA
GCCTGGGACA GGAACCTTAC TGATGAGGAC GGGCCCTACA TAGAGCTGAT GACCGGTGTT
TATACCGATA ACCAGCCCGA TTTCTCCTGG CTGATGCCTT ATGAAGAAAA ATCTTTTGTA
CAATATTTTA TGCCTTACCG CGAACTTGGG GTGGTAAAGA ATGCCAATAA GGATATTTTA
CTCAATTTAA CCCGGACCGG GAAAAAAGCC AGGCTGCAAA TTTTTGCAAC ATCTGTACAG
CGTGAAAATA AGGTTAGCTT GTCTGTTAAA GGTGAAGTGG TTTATTCAGT GGTATTTGAT
GTCAGCCCGG AAAGTACATT TGAAAAAGAA ATTGCCCTCC CAACCCAACT GGAGGAAACC
GATATTTTGC TGGTGATCAC AAATGCCTCA GGAAAAATCC TGCTGAAGTA CGAACCGGCT
GCCGATCAGA AAAATGAAAT TCCTGAGCCT GCAAAAGCAG CGCTGCCACC ACAGGAAGTA
GAAAATAACG AACAGCTTTT TCTTACAGGA CTGCACCTGG AACAATACCG GCATGCCACT
TATAATCCTG TTGATTATTA TGAAGAGGCC TTACGCCGCG ATGATAAGGA TGTGCGGGTG
AACAATGCGA TGGGAAAATG GTATTTGAGA AGGGGACAGT TAAATAAAAG TGAACCTTAT
CTGCGCAAAG CTGTTGAGAC CCTGGTTTCA CGTAACCCCA ATCCTTATGA CGGAGAACCT
TACTACAATC TGGGGCTTTG TTTAAAGCTT CAGGGCAAAT ACGATGAAGC TTACGATCAC
TTTTATAAAT CGGCATGGAA CAATGCATGG CAGGATAGTG CCTATTTTTC ACTGGCATTG
ATCGATGCCG CCCGTGGTGA TTTTGAACAG GCCCTGCAGC ACATTGAATG GTCGATAGAC
CGTAATGCAA GGAACAATAA GGCCCGGGCC TTAAAAGTGA TGGTGCTCAG AAAAATGGAC
CGGCTGGATG AAGCTGCCAT AGCTGCTGAG ACTGCTATAG AACGTGATCT TTTTAACCTG
AACATCTATT TTGAACAGTC TAAAATATAT AAGGGTAAAG GGCAGGAAGA AAAGGCTTTA
AAGGCATTAA ATGTACTGTT TAGATTGTCC AGACAAAATG TGCACAGCTA TATTACCTAC
GCGCTTGACT ACGCTGCGGC AAACCAATAC AGTGAAGCCA TAGAAATGAT GAGTTTTGCA
TTGGACCAAA GTGATACCGA TACCTATCCG ATGGTATTTT ATTACCTGGG CTGGTTTAAT
GTACAAATGG GCCAGGAGCA AAAGGCTTTA TCACTTTTTA AGAAAGCGGC CCTGGCCAAT
TCTGATTATT GTTTCCCCAA CCGGATAGAA GATGTAGCCG TGTTGCGTTC GGCAATGAGC
AGGAATGAAA ATGATGGCAA AGCACCTTAT TACCTTGGTA ATTTATGGTA TGATAAACGA
CAATACGATG AGGCCATTAA GATGTGGGAA TCTTCCATAG AGAGAGAGGA TACTTTTCCG
GCAACCTGGC GCAATTTGGG GATTGCCTAT TTTAACAAAA GAAATGATGC CGGTAAAGCC
CTGGAATGTT TTCAAAAGGC TTTTACACTT GATCAGACAG ATGCCCGTAT TTTTATGGAG
CTTGATCAGC TGTATAAAAG ATTGAATACC CCGGCAAGTG AACGTCTGAA TTTACTTGAA
GCACATCCTG CTCTGGTAGC ATTAAGAGAC GACGTTTACC TTGAACAGAT TGCTTTATAT
AATTTTACAG GTAGGTACGA AAAAGCCTAT GCAATGATCA TGTCCCGGCA GTTTCACCCT
TGGGAAGGCG GAGAGGGTAA GGTTTCGGGC CAGTACCTCT ACAGCCTGAC AGAGATGGCC
AAGCAACACA TACAGGATCA GGAATATCAG CTGGCTGTTG ACAAATTACA GATGGTACGG
GAATATCCGC ATAACCTGGG CGAAGGTAAG CTTTACGGTA TACAGGAAAA CGATATTTTT
TACTGGCTGG GTTGTGCATA CGATGGGTTA GGACAAAAAG ATATTGCCCG CGAACACTAT
CATATTGCCG TAAAGGGCCT GTCAGAACCA TCAGCGGCTA TGTTTTATAA TGATCAGCAA
CCCGACAAAA TTTTTTACCA GGGGTTGGCC TGGATGAGGC TCAATGCACC CGAAAAGGCA
AAAATGATCT TTGAAAACCT GATCAGCTAT GGAAGGGCAC ACTTAAATGA TGTTGTTAAA
ATAGATTATT TTGCGGTATC ACTGCCTGAT CTCCTTATTT TCGAAGATAA CCTGGATATA
CGGAACAAGA CGCACTGTCA TTACCTGATC GGTTTAGGTT TATTGGGCCT GAAGGAGCTG
AGCAATGCCA AAAAGGAATT TGAACTTGCT TTAAAAAATG ATGCTATGCA CTTTGGTGCA
AGCGTACACC TGCGGGCTGC AGGCTATAAA GATGAACCCG CGTTAATTTA A
 
Protein sequence
MSELFVNVWE EQVSIPTYPT GKPEKNPMFF EKRIYQGSSG VVYPNPVIEK IYDDKEDKSY 
TALYLENKYL KIMILPELGG RVHMAYDKIK QRHFIYYNQV IKPALVGLCG PWISGGIEFN
WPQHHRPSTF EPVDYAIEEN ADGSKTVWVN EVEKMFLTKG MAGFTLHPDK AYLEIKGKLY
NRTSLPQTFL WWANPAVKVN DDYQSVFPPD VNAVFDHGKR DVSTFPIATG TYYKVDYAPG
TDISRYRNIP VPTSYMAIRS DYDFVGGYEN DTEAGLLHVA SHHISPGKKQ WTWGNSDFGK
AWDRNLTDED GPYIELMTGV YTDNQPDFSW LMPYEEKSFV QYFMPYRELG VVKNANKDIL
LNLTRTGKKA RLQIFATSVQ RENKVSLSVK GEVVYSVVFD VSPESTFEKE IALPTQLEET
DILLVITNAS GKILLKYEPA ADQKNEIPEP AKAALPPQEV ENNEQLFLTG LHLEQYRHAT
YNPVDYYEEA LRRDDKDVRV NNAMGKWYLR RGQLNKSEPY LRKAVETLVS RNPNPYDGEP
YYNLGLCLKL QGKYDEAYDH FYKSAWNNAW QDSAYFSLAL IDAARGDFEQ ALQHIEWSID
RNARNNKARA LKVMVLRKMD RLDEAAIAAE TAIERDLFNL NIYFEQSKIY KGKGQEEKAL
KALNVLFRLS RQNVHSYITY ALDYAAANQY SEAIEMMSFA LDQSDTDTYP MVFYYLGWFN
VQMGQEQKAL SLFKKAALAN SDYCFPNRIE DVAVLRSAMS RNENDGKAPY YLGNLWYDKR
QYDEAIKMWE SSIEREDTFP ATWRNLGIAY FNKRNDAGKA LECFQKAFTL DQTDARIFME
LDQLYKRLNT PASERLNLLE AHPALVALRD DVYLEQIALY NFTGRYEKAY AMIMSRQFHP
WEGGEGKVSG QYLYSLTEMA KQHIQDQEYQ LAVDKLQMVR EYPHNLGEGK LYGIQENDIF
YWLGCAYDGL GQKDIAREHY HIAVKGLSEP SAAMFYNDQQ PDKIFYQGLA WMRLNAPEKA
KMIFENLISY GRAHLNDVVK IDYFAVSLPD LLIFEDNLDI RNKTHCHYLI GLGLLGLKEL
SNAKKEFELA LKNDAMHFGA SVHLRAAGYK DEPALI