Gene Phep_4230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4230 
Symbol 
ID8255366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp5107215 
End bp5108405 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content42% 
IMG OID644937896 
Producthypothetical protein 
Protein accessionYP_003094483 
Protein GI255534111 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000155249 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCTT ATATTTTAGC TATTGGGCTG ATGATGGCAA TAAATTTTGC CATTGCGGGC 
ACCAAAGATC CGAATGATCC TGTTCGTTCG AAAACCTTTT CAAAAAGTTT TCCATTAGAT
GGGAATGATA AGGTAAACCT GAACAACAGG TACGGGGAGA TGCTGATCAA AACCTGGGAT
AAAAAAGAAA TACGGGTAGA CATTGACATT AAAGCTTACA GCAAGGACGA GGCTGATGCG
CAGCGTTTGG TAGATGGAAC AGTTATAACT GCAGATAAAA GCGGCGACCA GGTTTCATTT
AAAACCAATT TTGCAACAGA GGGTGGTAAA TCAGGTGCCA GTTTCAGGAA CGGAAAAACG
ATATCCAGAA GGGAAATCAG GGTAAATTAC GTCGTATATA TGCCTGCGAC CAACGCTTTA
ACCTTAAGTA ATCAATATGG AAATGTAAAT ATAGGCAGTT TTTCCGGCGC ATTGATCGCT
AAGGTTCAAT ATGGGAGTTT AACGGTGGGA AATCTGAAAA ACGCCAGCAA TATCCTGGAA
ATTCAATATG GCTTTACTAA AATTCAGGAA ATAAATGCAG CTACGATTAA ACAGCAATAC
GGGGCGGGGC TTAACATTGG TACCATTGGC AGCTTGAATC TGGAGGCACA GTATGCGGGC
GTAACCATTG GCACCATTAA ATGGGATGCT GTGGTTAAAC AGCAATACGG ACCGGGACTG
GACATTGGCA GGGTAAGTAA CCTGGACCTG AATGTACAAT ATGCCAATGT AAAACTGGGA
ACGGTTATCG GGGATGCGAA TATCAAACAG CAATACAACA AACTCTCGAT CGGGTCTGTA
AACACATTAA ACCTAAAGAG CCAGTACACA ACAGTTGCCA TTGGGAATTT AAATGGCCCG
GGTAATTTTG GTGTTGCCTA TGGCAAACTG ACTGTCGAGC AAATAGGCTC AGGGTGTAAA
AATTTAAACC TGTTGAGCAG TTATTCACAT ACTTCCTTAA AGTTCAGCGA CAATTACCAG
GGGAACTTTG AATTAAGGAC CAGCTATTCG CCATTCAAAG CAGGTGCAGG AGTAAGCTCG
AAACTGGTAG CCGAAAAAGG GAACATTAAA AATTATGCAG GTACCATAGG AAACGGTGGC
GGAGCCCAAA TTATGCTCAA GGCCGATTAT GGTTCGTTGA ATTTAAACTA G
 
Protein sequence
MKSYILAIGL MMAINFAIAG TKDPNDPVRS KTFSKSFPLD GNDKVNLNNR YGEMLIKTWD 
KKEIRVDIDI KAYSKDEADA QRLVDGTVIT ADKSGDQVSF KTNFATEGGK SGASFRNGKT
ISRREIRVNY VVYMPATNAL TLSNQYGNVN IGSFSGALIA KVQYGSLTVG NLKNASNILE
IQYGFTKIQE INAATIKQQY GAGLNIGTIG SLNLEAQYAG VTIGTIKWDA VVKQQYGPGL
DIGRVSNLDL NVQYANVKLG TVIGDANIKQ QYNKLSIGSV NTLNLKSQYT TVAIGNLNGP
GNFGVAYGKL TVEQIGSGCK NLNLLSSYSH TSLKFSDNYQ GNFELRTSYS PFKAGAGVSS
KLVAEKGNIK NYAGTIGNGG GAQIMLKADY GSLNLN