Gene Phep_3404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3404 
Symbol 
ID8254523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4051749 
End bp4052984 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content45% 
IMG OID644937056 
Productdomain of unknown function DUF1735 
Protein accessionYP_003093660 
Protein GI255533288 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00777972 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGCTA CTCATAAAAC CAACAACATG AAAAAAATTA AATTAGCTGC GTCGATAGGC 
ATTTTAATGA TGCTGGCAGC ATGTAGTAAA GACGTCAAAG TTGATCTGGA TCTTCCTAAA
GCAGAGCAGT TGAAAAAGGT ATATATGCCG CAAGCTGCCA ACCCGGTTGA AGTCCGTTCT
GTAAGTATTG TCGATAAGGA CACCTCCTTT GTATATAGTG CATTTCTTGG GGGCCCCAAA
CCAGCAACCG GTAACATTAC AGTAGGTTTT ACTGTCATGC CGGAAAAAGT TACGGCTTAC
AATCAGCAAA ACAGTACCAA TTATCAGCTT ATGCCTCAGG GTAGTTATGT GCTCGAGGCC
CTTACTTCCG TTATTCCGGC TGGTGGGCAG TCAACAGGCA GGTTAAACCT GTCGATCAAA
ACAAAAGGCT TTTTAAATCC TTTTGAAACT TATCTCTTGC CTGTTACCGT GACTAAGCAA
ACCGGAGAAG GAACGCTGAA CGAAAGTCTG GCCACCACTT ATTACCTGAT TGCGGGTTCT
TATGCACCGG GCGAGGTGCC GCGTGAAAAA ACGCTTGCCT TGGGTACTGC AGGCGTTGGA
AACATCATGC TTGATTTTGA TGGAAAGCTG ATCATAAAAA ATGCTGATGG CAACTTATTG
CTTTATCCGG TAAATGTAAA CGGCACCTTT GGTACACCAG CACAAATTGG TGTGGGATGG
AACATTTTCA ATATGATCTT TTATTTCGGT GGCGACCGGC TGATTGCCAG ATGGGCAAGC
GGCGGACAGG ACATCAGCCA ATATGCCATA AGCAAATCGG GAGCTTTCGG CGGCAGTAAA
TCGATCGGTC AGGGCTGGGG GATATTTACT AAAATCATTC CGTTCAAAGG GCTTTTACTT
GGCGTAGACG GCGCTGGCGA TATGACCATG TATCCCCTGG ATGTTGCCGG CAACTTTGAT
TTTGGCAGAA TTAAAAAAAT CGGTACCAAA TGGAACGATT ACAAACAGGT TTTCGCTTAC
CAAAATTCCC TGATCGCGAT TGAGCCAGGT GGAGATATGT ACCAGATCCC TTTATCAGAC
AGCGGTGTAT TTGGCTCCAG AAGAAAAGTG GGTAATGGCT GGGATATGTA TGTAAATGTA
TTCGCATCTG GTGACGACCT GTTAGCGCTG GATAGCAACG GCGACTTATG GCGCTACCGC
TTCAACCCAA TCGGTTTCTG GCCCTTAAAG AAATAG
 
Protein sequence
MVATHKTNNM KKIKLAASIG ILMMLAACSK DVKVDLDLPK AEQLKKVYMP QAANPVEVRS 
VSIVDKDTSF VYSAFLGGPK PATGNITVGF TVMPEKVTAY NQQNSTNYQL MPQGSYVLEA
LTSVIPAGGQ STGRLNLSIK TKGFLNPFET YLLPVTVTKQ TGEGTLNESL ATTYYLIAGS
YAPGEVPREK TLALGTAGVG NIMLDFDGKL IIKNADGNLL LYPVNVNGTF GTPAQIGVGW
NIFNMIFYFG GDRLIARWAS GGQDISQYAI SKSGAFGGSK SIGQGWGIFT KIIPFKGLLL
GVDGAGDMTM YPLDVAGNFD FGRIKKIGTK WNDYKQVFAY QNSLIAIEPG GDMYQIPLSD
SGVFGSRRKV GNGWDMYVNV FASGDDLLAL DSNGDLWRYR FNPIGFWPLK K