Gene Phep_1867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1867 
Symbol 
ID8252971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2156917 
End bp2157867 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content41% 
IMG OID644935518 
ProductMammalian cell entry related domain protein 
Protein accessionYP_003092137 
Protein GI255531765 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.449047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATAG CAAACGAAAC TAAAGTAGGC ATCCTGGCGG CCTTTTCAAT TACCTTATTA 
ATTATTGGTT ATAACTTTTT AAAGGGTAAT GCCATTTTTT CGAATGAGAC CGTGCTGTAC
GCCAGATACC CTAGGGTTGA TGGACTGGGC GTGTCCAAAC CAGTTCTGAT CAATGGTTTT
CAGATTGGCC GCGTAGATAA GCTACAATTA CAATCAGATG GGAGTATCCT GGCCACTTTA
AAGATCAAGG GAAAATATGA AATCCCAAAA AACAGTATAG CTAAACTGGA AGGTACCGAC
CTGCTGGGCA GTAAAGCTAT TGTAATGGAA CTGGGCACGG GTCAGGATTT TGCGCAGGAT
GGGGATACCC TTAATGCAAA TGTGGCTAAG GGTCTGCTTG AAACCGTACA GCCGGTTCAG
AAAAAAGCTG AACTGATCAT CACTAAAATG GATTCGATCC TGACAAGCGT AAACTCTATC
CTAAACCCTA ACTTTCAAAA GAATGTTGAT AAAAGTTTTA ACAGCATAGC TTCTACGCTT
TCTTCATTGG AAGCTACATC TAAAAAGGTA GATAATCTGG TGGGCTCTGA AGGGTCCAGG
GTATCTGCAA TCCTGGCCAA TGTAGAGGCC ATTTCAAGTA ACCTGAAAAA GAACAATGAA
AAGATAAACG GCATATTGAA TAACATTGGC AATATCACAG ATCAGGTGGC TGCAGCTAAT
TTTAAACAGA CCATAGAGAA TGCCAATAAG GCCATGGCCG ATCTGCAGAC CATTGTTAAT
AAGGTGAACA ACGGACAAGG AACCCTGGGT ATGCTGGTGA ATGATACAAA AATGTATGAA
AACCTGAACA ATGCCTCTAA AAACCTGGAT AACCTGATGA TAGACCTGAA ACAAAATCCT
AAACGTTACG TTCACTTCTC CGTATTCGGA GGTGGTAAAA AGGATAACTA A
 
Protein sequence
MKIANETKVG ILAAFSITLL IIGYNFLKGN AIFSNETVLY ARYPRVDGLG VSKPVLINGF 
QIGRVDKLQL QSDGSILATL KIKGKYEIPK NSIAKLEGTD LLGSKAIVME LGTGQDFAQD
GDTLNANVAK GLLETVQPVQ KKAELIITKM DSILTSVNSI LNPNFQKNVD KSFNSIASTL
SSLEATSKKV DNLVGSEGSR VSAILANVEA ISSNLKKNNE KINGILNNIG NITDQVAAAN
FKQTIENANK AMADLQTIVN KVNNGQGTLG MLVNDTKMYE NLNNASKNLD NLMIDLKQNP
KRYVHFSVFG GGKKDN