Gene Phep_2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2202 
Symbol 
ID8253308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2533061 
End bp2534422 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content44% 
IMG OID644935851 
Productprotein of unknown function DUF1080 
Protein accessionYP_003092468 
Protein GI255532096 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.262021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCT CACAATTATT AATTTTAACA GGTTTGCTGT TTGGCAGCAC CGCAGGTACT 
TTACAGGCAC AAGGCGGGAA ATGGCAAAAC CTTTTTAACG GCAAAGACCT GAAGGGCTGG
AAACAGTTAA ATGGAAAGGC AAAATACGAA GTTGTAAACG GGGAAATCGT AGGTACTACG
GTACGCGATA CGCCCAATTC TTTCCTGGCT ACAGAGAAGA ACTATGGTGA TTTTATTTTT
GAAGTAGAAT TGCTGGTAGA CAATTCCATG AACTCGGGCA TACAGTTCAG GAGTGAAAGT
AAGGCCGACT ATAAAGAAGG CAGGGTGCAT GGTTATCAGA TGGAGGTAGA TCCTTCTGAC
AGGGCCTACA GTGGCGGCAT TTATGATGAG GCCCGTCGTG GCTGGCTGTA TCCGATGGAC
ATTAACCCTG CAGGTAAAAC AGCTTTTAAA AAGGGGGAAT GGAACAAATA CCACATCGAA
TGCATTGGCA ATTCGATCAG GACCTGGGTA AATGGTGTGC CTACGGCTAA TGTAGTTGAC
GATATGACCT CCTCAGGTTT TATTGCCCTG CAGGTACATG CTATTGGTAA AAATGATGAG
CCGGGTAAAC AGATCAGGTG GAGGAACATC CGTATCCAGA CAGAAAACCT GAAGCCTGCA
AAAGCAGACC ATATATTTGT GGTGAACATG ATCCCCAATA ACCTGTCGCC TGCCGAAAAA
GTAAACGGAT ACAGCCTGTT ATGGGATGGT AAGACCAGTA ATGGCTGGAA AGGTGCCTAT
AAGCCTGGTT TTCCTGAAAA GGGATGGGAG ATTAAAGATG GGGTGCTGAG TGTACTGAAG
TCCAACGGTG CGGAATCGAC CAATGGCGGT GACATTGTAA CGGTTAAACA ATACGGTGCT
TTTGAAATGA AGTTTGATTT TAAACTTACT GAAGGTGCAA ATAGCGGGGT TAAGTATTTT
GTTACCCTTA CTGAAGGCAA TAAAGGTTCG GCGATTGGGC TGGAGTATCA GATACTGGAT
GATGAGAGAC ATCCGGATGC CAAACTGGGC AAAAACGGTA ACCGTAAACT GGGTTCTTTG
TATGACCTGA TCACCAGCAA AAAAATACCC AATGCACAAA GGAAAATCGG CGAATGGAAC
AAAGGGGTAA TTAAGGTATA TCCCAACAAT AAGGTTGAAT ATTATTTAAA CGGATTTAAG
ATCCTTGAAT ATGTACGGGG ATCGGCCGAG TTTGAGGCAT TGGTTGCAGA AAGCAAATAT
AAGAACTGGA AAAATTTTGG TATGGCGCCT AAAGGCCATA TCCTGCTCCA GGACCATGGC
GACAGTGTAT CCTTCAGAAG TATTAAATTA AAAGAACTAT AA
 
Protein sequence
MKTSQLLILT GLLFGSTAGT LQAQGGKWQN LFNGKDLKGW KQLNGKAKYE VVNGEIVGTT 
VRDTPNSFLA TEKNYGDFIF EVELLVDNSM NSGIQFRSES KADYKEGRVH GYQMEVDPSD
RAYSGGIYDE ARRGWLYPMD INPAGKTAFK KGEWNKYHIE CIGNSIRTWV NGVPTANVVD
DMTSSGFIAL QVHAIGKNDE PGKQIRWRNI RIQTENLKPA KADHIFVVNM IPNNLSPAEK
VNGYSLLWDG KTSNGWKGAY KPGFPEKGWE IKDGVLSVLK SNGAESTNGG DIVTVKQYGA
FEMKFDFKLT EGANSGVKYF VTLTEGNKGS AIGLEYQILD DERHPDAKLG KNGNRKLGSL
YDLITSKKIP NAQRKIGEWN KGVIKVYPNN KVEYYLNGFK ILEYVRGSAE FEALVAESKY
KNWKNFGMAP KGHILLQDHG DSVSFRSIKL KEL