Gene Phep_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1103 
Symbol 
ID8252197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1290121 
End bp1292088 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content47% 
IMG OID644934754 
Producttype II and III secretion system protein 
Protein accessionYP_003091383 
Protein GI255531011 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00448794 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000241277 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAAAT CGATTTTTAA ACCCTTATTC GTGCTGTTGT TTTTGTGTGT ATCGTTATTG 
GCACAGAAAC CGGTTTTTGG ACAGCAAAGT CAGCAGGAAC GGATGGAAAC CCTGGAGAAA
AAACTGGTGG GACTTTCGGC CTTTGTACCC GGCTTAAAGC AAAAGGTGCA GCTGGGGCTG
TCGGGTGCCT CCATTCAGGA GTTTTTAAGG GCGGTGGCCC AGGCCAATCA GCTCAACATC
AATGTAGACC CGATGCTGAA CATCAAGGTG TCGAACAATT TTACCAATGA GAATGCCTTG
AATGTGCTGC TTTTCCTGGC CAAAAATTAT AACCTGGAAG TGAACGTGAT CGGTTCTATC
ATTACGGTTG GGCCCTATAA CCCACCGGTG ATCAAAGCCC CTTATGTGCC AAAAGTAATT
GCCCTGAAAT ACAATGCGGC TGCAAATACC CTATCGATGG AACTCAGCAG CGACAGCCTG
GTGCTGGTGG CCCGCAAGAT CAGCCAGCTT TCAGGAAAAA ATATTGTGGT GCCGGTAAGT
TTAAACAGCA AACTGGTAAG CAGCTTTTTT AACGAGGCCC CATTGGAGAC GGCCCTGGAA
AAACTGGCAT TTGCCAATGA GCTGAAGCTG ACGAGGACCA GCGACAATGT ATATGTTTTT
CAGGCCCTGG AGGAAGGGGA AGAAGTTTTT ATCAATGCAG ATAAGGACAC TGACATCAGG
AAGAACCTGA AACCAGCCGC CGGGGCCGGG GCTGCTAAGG GCACGTTTAA TGTTTCAGGA
AAAAGCGATA AAAACATTAA AAGTGGTAAG CTGCTGAACA TTGATGCCAT CAATACCCCG
ATCATAGACC TGGTGAAATC GGTCGCTAAC GAGACGAATA TCAACTTTTT CCTGTATTCG
GAACTGAAAG GGAACATCAG CACCAAGGTG AACAACATTA CCTTCGATAA TTTTTTGACG
GCGATGTTTA ACGGAACGGA TTATACCTAT AAAGTGGATA ATGGAGTATA CCTGATTGGC
GACAGGAAAC TGGAGGGGCT GCGCAAGAAC AAAGTACTGC AATTGCAGTA CCGGTCTGTA
GATACGATCA TGAACATGAT CCCTATGGAA TGGAAAAAGG GGGTGGACAT TAAGGAGTTT
AAGGAGCAGA ACACCTTGCT GCTTTCCGGT TCGGCCCCGC AACTTGCGGA AATTGAGACC
TACATCAGGG AGCTGGATAA ACTGGTGCCG ATGGTGCTGA TTGAAGTGAC CCTGCTGGAT
ATCAGGAAAG GGCATACCAC CAAGACCGGC ATAGAAGCCG GACTGGCAGA TTCGGTGGTG
CGGACCAGGG GTACTGTGCT GTCGGGTGTG GACATGACCC TGGGTGCAGG TTCTATCAAT
AACCTGCTCG GAAAAATTGG CTCCAACAGT GTATTTAATA TCGGAAAGGT AACCCCCAAC
TTTTACGTGA AACTAAATGC ACTGGAAGCC AACAACAATA TCGAGATCAG GCAGGTGCCC
AAACTGGCCA CCCTGAATGG TCATACGGCC AATTTAAGTA TAGGCACCAC CCGGTATTAC
GTGACCAAAA CGCAGAACGT GTTTTCATCG GTTAATACCC AAACGGTATT TACCGAACAG
TTTAACAAGG TGGATGCCAA CCTGGCCATT GGCATTAGTC CCGTGGTTTC GGGTGATGAC
CAGGTGACCC TAAAAATTAA GGTAGAGATA TCTGACTTTA TTGGTACGCC GCCCGCAAAT
GCACCGCCAC CGACATCGAC CAGTAAGTTT GAGTCGATTA TCCGGGCACA TAACGAGGAT
ATGATTGTAC TGGGTGGTAT GGAAAGGTCG GAAAAATCGG ATAGTGGAAG TGGCGTACCC
CTGTTGTCGC GTATACCGGT CCTGAAATGG ATATTTAGCA GCAGGGAGCA AACGAAATCG
AAGGTAGTAA CGCTGGTCTT TATTAAACCG ACCATAATTT ATCAATAG
 
Protein sequence
MKKSIFKPLF VLLFLCVSLL AQKPVFGQQS QQERMETLEK KLVGLSAFVP GLKQKVQLGL 
SGASIQEFLR AVAQANQLNI NVDPMLNIKV SNNFTNENAL NVLLFLAKNY NLEVNVIGSI
ITVGPYNPPV IKAPYVPKVI ALKYNAAANT LSMELSSDSL VLVARKISQL SGKNIVVPVS
LNSKLVSSFF NEAPLETALE KLAFANELKL TRTSDNVYVF QALEEGEEVF INADKDTDIR
KNLKPAAGAG AAKGTFNVSG KSDKNIKSGK LLNIDAINTP IIDLVKSVAN ETNINFFLYS
ELKGNISTKV NNITFDNFLT AMFNGTDYTY KVDNGVYLIG DRKLEGLRKN KVLQLQYRSV
DTIMNMIPME WKKGVDIKEF KEQNTLLLSG SAPQLAEIET YIRELDKLVP MVLIEVTLLD
IRKGHTTKTG IEAGLADSVV RTRGTVLSGV DMTLGAGSIN NLLGKIGSNS VFNIGKVTPN
FYVKLNALEA NNNIEIRQVP KLATLNGHTA NLSIGTTRYY VTKTQNVFSS VNTQTVFTEQ
FNKVDANLAI GISPVVSGDD QVTLKIKVEI SDFIGTPPAN APPPTSTSKF ESIIRAHNED
MIVLGGMERS EKSDSGSGVP LLSRIPVLKW IFSSREQTKS KVVTLVFIKP TIIYQ