Gene Phep_4200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4200 
Symbol 
ID8255336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp5077730 
End bp5079046 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content51% 
IMG OID644937866 
Producthypothetical protein 
Protein accessionYP_003094453 
Protein GI255534081 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0156411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.730059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACA TGATCAAATT ACCGGCAATT GTACTTGGGC TGCTGCTCAT GTTCACCGGT 
GCAGCACAAA AGGTTATGGC GCAGGGAGAT GATGTTTCGC TCCAATCGTT TTATGATGAA
CTGTCGCCTT ATGGTACCTG GATACAGGAT CCGCAATACG GGTACGTTTG GAGACCTGAT
GTAGAACAAG GCGATTTCAG GCCTTATTAT ACCAATGGCC GCTGGGCAAT GACCGAGTAC
GGTAACACCT GGGTTTCCAA TTACGACTGG GGCTGGGCGC CTTTTCACTA TGGAAGATGG
GTTTACAACC GTTACGGTCA ATGGATATGG ATTCCTGATA CCACATGGGG ACCTGCATGG
GTAAGCTGGA GAAGTGGTGG TGGCTATTAT GGCTGGGCGC CGATGGGCCC TGGAATGAAC
ATCAATATCA ACCTTAACAT CCCCGATCTG TGGTGGGTAT TTATCCCCCA AAGGAATATC
TATTACGATA GCTTTCCACG ATATTATTCC CGCAGAAATG TGACCATCAT CCATAACACG
ACCATCATTA ACAATACTTA CGTAAATAAC CGCCGTACTT ATTACACTGG CCCAAGGGCC
GATGACATCA GACGTGCAAC CGGAAGAGAT GTAAGAGTAT ATAATGTAAA TACCACAGGC
AGGCCAAGCC GCAGTAACAT AAATGGCAAC AGTGTTGACA TCTATACGCC AAGGCCAAGC
AGGGGTAGTT CCAATGTAAA TGCCAAACCA CGGGAAGCCA TCAGAGGAGA AGGGTATACC
ACACCAAGAG GAGACCGCGG AACGGCAAGC AACGGATCAT CGTCCGGGCG CCCTTCCAGA
ATTGACAACC AGGGTAACAG ACCTGATAAC CGTGAAAACG GCGTAACTAC ACCTCAAAAC
AGGGGCGAAA GACCTATTTA TGAAAATAAT GGCAGGCCAT CAAGAACAGG AAGCCCGGAA
AACAACGGCT CAACAAGACC ACAGCGAATA GAAAGACAAA ATCCTTCAGG AGAAACTCCT
GCACAAAGGC CACAGGAAGT TCAGCCTGCC CCACAACAAC GTCAGGAGCG TCAGGAGCGT
CCTCAACCAC AGGCACGACC TCAGCGTCAG GAAAACAGGC CGGAAGCCCC GCGTCAGCAG
GAAAGACAGC AACAACCACA ACGTCAGGAA AGCAGACCGC AGCAGCCACA GGCCCAGCCT
CAGTACCAAA GACCGGAACG GACCCAACAA AGTGCACCTC CGGCCAGAAG TTCTGAAAGC
CGCGGCGAAC AAGGTGGCAG AGGAGCTGAA CGACCAAGCC GCGGAGGCAG GAGTTAA
 
Protein sequence
MKNMIKLPAI VLGLLLMFTG AAQKVMAQGD DVSLQSFYDE LSPYGTWIQD PQYGYVWRPD 
VEQGDFRPYY TNGRWAMTEY GNTWVSNYDW GWAPFHYGRW VYNRYGQWIW IPDTTWGPAW
VSWRSGGGYY GWAPMGPGMN ININLNIPDL WWVFIPQRNI YYDSFPRYYS RRNVTIIHNT
TIINNTYVNN RRTYYTGPRA DDIRRATGRD VRVYNVNTTG RPSRSNINGN SVDIYTPRPS
RGSSNVNAKP REAIRGEGYT TPRGDRGTAS NGSSSGRPSR IDNQGNRPDN RENGVTTPQN
RGERPIYENN GRPSRTGSPE NNGSTRPQRI ERQNPSGETP AQRPQEVQPA PQQRQERQER
PQPQARPQRQ ENRPEAPRQQ ERQQQPQRQE SRPQQPQAQP QYQRPERTQQ SAPPARSSES
RGEQGGRGAE RPSRGGRS