Gene Phep_4044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4044 
Symbol 
ID8255178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4887408 
End bp4888949 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content46% 
IMG OID644937708 
ProductRhomboid family protein 
Protein accessionYP_003094297 
Protein GI255533925 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.887377 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTTC AGTGGGGCTA TTCTCCAAAA ATTGAAAAAT ACATCCCATT GGGTAATTTT 
CCGGCCGACA GGTACCTGGT TATTGCACAA CAGGCGATAG AGAACCTGGG CTGGAAACTA
AGCCATTTGT CGGAATCGGG GATTATTGCC TATACAGGTC TTTCGGCACA ATCGTACAGT
GAAGAGATTT CTATCCGTAT CATTGCTAAT TTTGCTGTTT TTAAAAGCGA ATGTATCGGC
ATTCAGCTGC TGTTTACCGA TTATGGAAAG AACCAGGAGA ACCTTGACCG GTTTTTTCAT
GAATTTGAAT ATGTGGAGTA CCACCTGGCC GCGGTTTGGG AGGACAGGCT GAAGGATTTT
CATGATCATA TTGCTAAGCA GGACGACAGC TATTTTAACC GGGCACCTTT AAAAGCAAAA
GATAAGATTA AAAATATATT TTACCTGTTC ATTCCGCAGA AGGGTTATCT GGTTACCCCC
ATTATTGTAA CCCTGAACAT TTTGCTGTGG CTGGGCAGGC TGGCCATAGT GCTGCTGCTG
TCGAACCTTT TTAAGGCGCA GCTGGGGGGG CAGGGTTTAC TTACGCCAGA ACTTTTATTG
AAGATACAGT CTTATTTCGG GATAAACAGC AGGGACCTGA CCCTGAGTGG GCAATGGTGG
CGATTGCTGA GCAGTCAGTT TTACCATTTC TCGCTGCTGC ACCTGTTTTT CAATATGTAT
GCGCTGATCT ATATCGGGCT GATGACAGAG AACAAGCTGG GCTGGGCAAA AACACTGATA
GTTTATATTT TAAGTGGAAC ATGTGGGGCT TTGCTAAGTG TTTACGGGCA TAAGATCGGA
TTTATGGGTG GGGCTTCGGG GGCCATTATG GGCATGTTTG GTGCTTTTCT GGCGCTGCTG
CTGAGCAATG CTTTTGAAAA GACCGCTGCC AGGGCCTTAC TGATCAGTAC GGTAATTGTG
GTAGCATACA TGCTACTGAA TGGCCTGCTG AGTGAGACTG CCGACAATTC GGCGCATCTG
GGCGGATTGG TTTCAGGCTT CCTTATTGGT TATCTTCTAT ATAATGAACG TTTGCTGGGC
CAGCCTGTGC CGCTGTTGTA CAGGGCTTCG GCTTCGGGTT TTATGGTGCT GGCCTTTGCT
GCACTCATTT ATCAGTTTTC GCCCCGATAC CAGGTAGAGG AATATGCAAA ACTAAGGGAT
GCCTTTAATT TAAATGACGA AAGGTTTAAC CAGATCTATT ACATCAGCAG TGATTTGCCT
TTGGAAGAAA AACTGAGGAG GGTGAAGCTG AACGGGATAG ATGTTTGGGC CGAGAACCTG
AAGATCACCA GGGAAATGGA CAAGCTGATC GTATTGGAAA TGGATGGGAT AGACCGGGAT
TACCGTAAGG TAATTGCGGA AAAAGCCTAT GCGGTGTCGA TGCTGATGTA TAAAGACTAT
GAGTCGGGTA CACGGGAAAA CAGGGCAGTT ATCCAGGAAA AGATCAATGA GGTGATGCGG
CTGAAGGCGA AACTGAGAGA ACGCCTGACG GAGCCTGAAT AA
 
Protein sequence
MAFQWGYSPK IEKYIPLGNF PADRYLVIAQ QAIENLGWKL SHLSESGIIA YTGLSAQSYS 
EEISIRIIAN FAVFKSECIG IQLLFTDYGK NQENLDRFFH EFEYVEYHLA AVWEDRLKDF
HDHIAKQDDS YFNRAPLKAK DKIKNIFYLF IPQKGYLVTP IIVTLNILLW LGRLAIVLLL
SNLFKAQLGG QGLLTPELLL KIQSYFGINS RDLTLSGQWW RLLSSQFYHF SLLHLFFNMY
ALIYIGLMTE NKLGWAKTLI VYILSGTCGA LLSVYGHKIG FMGGASGAIM GMFGAFLALL
LSNAFEKTAA RALLISTVIV VAYMLLNGLL SETADNSAHL GGLVSGFLIG YLLYNERLLG
QPVPLLYRAS ASGFMVLAFA ALIYQFSPRY QVEEYAKLRD AFNLNDERFN QIYYISSDLP
LEEKLRRVKL NGIDVWAENL KITREMDKLI VLEMDGIDRD YRKVIAEKAY AVSMLMYKDY
ESGTRENRAV IQEKINEVMR LKAKLRERLT EPE