Gene Phep_4033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4033 
Symbol 
ID8255167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4876169 
End bp4878646 
Gene Length2478 bp 
Protein Length825 aa 
Translation table11 
GC content45% 
IMG OID644937697 
Producthypothetical protein 
Protein accessionYP_003094286 
Protein GI255533914 
COG category[S] Function unknown 
COG ID[COG4485] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.411624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.743107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATT GGTTAAAAGC GAATGGCATT CATCTGGCCA TCATTGGATT TTTTATACTC 
ATCTGTTTTG TCTATTTCAG TCCGGTTTTA CAAGGTAAAG GTCCGCAGCA AAGCGACGTA
TTGCAAGCTA AGGCCACTGC CAAAGAGATC ATGGACTACA AGGAAAAGGA CGGTAAAGGG
CCTTTATGGA CCAACCAGAT GTTTGGTGGG ATGCCTGCTT ATCAGATCTG GGTACAGCAC
CCTTACAATG CGGCCACCTA TGTGATCGAT TTCATGAAGG CGGTGTTCCC GGATCCTGTA
GACGTAGTTC TGATGTACCT GCTTGGTGCT TACCTGCTGT TTTGTGTATT GAAGGTAAAT
CCCTTACTGG CAGCAGCGGG GGCTATTGCT TTTGCTTTTA CCTCTTATAA TTTCCAGATC
ATTGCTGCAG GACACAGCAA TAAGGCCCTG GCCATTTCTT TTCTGGCGCC TATAGTTGCC
GGGATCATTT TGACCATCAG GGGTAAATAC TGGCTGGGGG CCAGTTTAAC TGCTTTGTTC
CTGGCGCTGG AGATCAGGGC AAACCATATT CAGATGACCT ATTACCTGAT GATTGCCCTG
CTTATTTTTA TAGGCATAGA GCTTTATCAT GCCATAAAAG GAAAGAAGGT AGCGGCCTTT
GGAAAGTCGA TGGGCTTCCT GGCCATAGCA TTGGTACTGT CCCTGATGGT AAATGCGGGT
AAATTATGGA CGACTTACGA ATATGGTAAA GAATCGAACA GGGGCAAATC TAATTTAACA
ACGGACAGTG CCGAAGAAAA GAACGGGCTT TCCAAAGAAT ACGCTTATGG CTGGAGCCAG
GGTGTAGGTG AGAGCTTTAC TTTCCTAATC CCCAATTTAT ATGGGGGCGG GACAGGGATT
GATGAACTGG TGAAACCCGA GAGCAATACC TATAAAGCTT TACAGAATGT GACCGGTGGT
GATCCGAGGC CAGCCATACA GCAGCTGGCG GGACAGGTTG GCCTGCAGCA GTACTGGGGA
GAAAAACCCT TTACTTCGGG GCCTTATTAT TTTGGTGCCA TTGTTTGTTT CCTATTTGTA
TTTGGCTTGC TGATTGTAAG AAGCCGGTTA AAATGGTGGA TATTGGGTAC GACCATTTTG
TTTATGCTGC TTTCTTTTGG CAGGCATTTT CCATTGGTCT CGGATCTGTT TTTTGAATAT
TTCCCGATGT ACAATAAATT CAGGGCAGTA GAATCTATTC TGGCCCTGGT AGGTTTAATG
GTGCCTGTTT TGGCTTTTCT TGCTGTAAAA GAAACCCTGG AAGGGACTAT GGATCAGAAA
ACACTGGGTA AGAAACTGAC GGTTGCCGGT GCCATAACTG GTGGTTTTAC TTTACTGGTG
GCAGTAATGC CGAGTGCATT TTTCAGTTTT ACGGCATCAA ACCACCCGCA AATGGTGCAG
GTGCTGACCC AGATTGCCCA GAACAATGCC GGCGTGGCAC AAAGTATTGC CAATGCATTG
GTTCAGGATC GTATTGATAT TGCCCGTGCA GATGCCTTGC GCTCTTTATT GTTTATAGCA
ATAGGCTATG CGCTGATCTG GGCATTGATC AATAAAAAGA TGGGCTTGCA GACCGTAATG
GTTTTGCTGG GGCTTGCCGT ACTGATTGAC ATGTGGCAGG TAGACCGCCG TTATTTAAAC
AACAGCAATT TTGTGAGCAA GTCGGATCTG AAAAATCATT TTCAGCCGAG AGAGATCGAC
AACCTGATCC TGGCCGATAA AGACCCGGAT TACCGGGTAC TGGACCTGAG CATTGCTACT
TTTCAGGATG CGAGTGCTTC GGCCTTCCAT AAAACGATAG GTGGGTACCA TGCGGCTAAA
CTGAAACGTT ATCAGGAGCT GATCGATAAA CAGTTCTCGA AAAGCATTAA CCAGGATGTA
GTGGATATGC TGAATACCAA ATACATTATT ACGCAGGACC AGCAGACAGG TTCGTACAAA
ATGCAAAGGA ATGCAACCGC TGCAGGGCAT GCCTGGATTG TGTCTCATGT TCAATTTGCA
AAGGATGCAG ATGAAGAAAT GAAGGCCATT AACAGTTTTG ACCCTAAAAA GGAAGCCATT
GTTGATGTAA GGTATAAACA ACTGATCAAT GAGAAACGTC TTGGCTCTGG TGTCGGGGCA
ATGATTACAC TGGACAGTTA TCATCCGGAC CACCTGGTTT ATTCTTACAG TGCGCCAACT
GATGTAATTG CCGTATTTTC GGAGATTTAT TACGATAAGG GCTGGAACAT GTATGTAGAT
GGGGCTGAGA AGCCTTATTT CAGGGCAGAT TATGTTTTGC GGGCTGCCCA GCTGGAAGCA
GGCAACCATA AAATAGAGTT TAAGTTTGAG CCGGTTTCGT ATTATGCAGG TGAAAAGATC
TCTTTACTAG GCTCAGTTTT ATTGATTGCA GGATTGGGAT TTGCGTTTTA TTCGGAGAAA
AAGGGAAAAA AACAATGA
 
Protein sequence
MNNWLKANGI HLAIIGFFIL ICFVYFSPVL QGKGPQQSDV LQAKATAKEI MDYKEKDGKG 
PLWTNQMFGG MPAYQIWVQH PYNAATYVID FMKAVFPDPV DVVLMYLLGA YLLFCVLKVN
PLLAAAGAIA FAFTSYNFQI IAAGHSNKAL AISFLAPIVA GIILTIRGKY WLGASLTALF
LALEIRANHI QMTYYLMIAL LIFIGIELYH AIKGKKVAAF GKSMGFLAIA LVLSLMVNAG
KLWTTYEYGK ESNRGKSNLT TDSAEEKNGL SKEYAYGWSQ GVGESFTFLI PNLYGGGTGI
DELVKPESNT YKALQNVTGG DPRPAIQQLA GQVGLQQYWG EKPFTSGPYY FGAIVCFLFV
FGLLIVRSRL KWWILGTTIL FMLLSFGRHF PLVSDLFFEY FPMYNKFRAV ESILALVGLM
VPVLAFLAVK ETLEGTMDQK TLGKKLTVAG AITGGFTLLV AVMPSAFFSF TASNHPQMVQ
VLTQIAQNNA GVAQSIANAL VQDRIDIARA DALRSLLFIA IGYALIWALI NKKMGLQTVM
VLLGLAVLID MWQVDRRYLN NSNFVSKSDL KNHFQPREID NLILADKDPD YRVLDLSIAT
FQDASASAFH KTIGGYHAAK LKRYQELIDK QFSKSINQDV VDMLNTKYII TQDQQTGSYK
MQRNATAAGH AWIVSHVQFA KDADEEMKAI NSFDPKKEAI VDVRYKQLIN EKRLGSGVGA
MITLDSYHPD HLVYSYSAPT DVIAVFSEIY YDKGWNMYVD GAEKPYFRAD YVLRAAQLEA
GNHKIEFKFE PVSYYAGEKI SLLGSVLLIA GLGFAFYSEK KGKKQ