Gene Phep_2472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2472 
Symbol 
ID8253579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2866291 
End bp2867394 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content47% 
IMG OID644936122 
ProductUBA/THIF-type NAD/FAD binding protein 
Protein accessionYP_003092738 
Protein GI255532366 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.935376 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCGT TTGGAAAGAT TGGGACAACA GTTTGGTTTA AGGAAATGGA AAATCAGAAA 
CGTTACGACA GGCAGGTCAT CCTTCCGGAA ATTGGCCTGG ATGGGCAGCA AAAACTTAAA
AATGCGAGGG TGCTTGTTAT TGGGGCGGGT GGCCTGGGCT GTCCCCTGCT CCTGTACCTT
GCCGCTGCAG GTATAGGCCA TATGGGCATA GCCGACCATG ATGTGGTGGA CGAGAGCAAC
CTGCAGCGGC AGATTTTATA CCAGATGGCG GATATAGGAG GCTTAAAAGC CGAAATTGCG
GTAAACAAGC TTGGCTTGCT GAACCCGGAT GTGGATTTCA GGGCTTATCC CTTTAAACTG
GGTATGGAAA ATGCGGCTGA ACTGATTGCT GTCTACGACC TGATCATCGA TGGTTCTGAC
AATTTCCCTA CCCGCTACCT GGTAAATGAT ACCTGTGTGG CCTTAAATAA AACACTGGTA
TTTGGTTCTA TATTTCGATT TGAAGGGCAG GTAACGGTAT TTAATCATAA AGGGGGACCA
GATTACCGTT CCCTTTACCC CGAGCCACCG GCAGCAACTG AAATGCCCAA TTGTGCTGAA
GCCGGGGTAA TTGGTACGTT GCCCGGAATT ACAGGCACCT TAATGGCCAA TGAGGCCATC
AAGATCATTT GTGGCTTTGG TGAGGTATTG TCGGGGAAAC TGCTGCTGTT TAATGCACTG
AACAATGAAA TGCAGGTATT TGGCTTTGGC AGTCACAATG TGCAAAACCG GCGCGCTGTA
GGGCAGAAAG GAAAGATGGA TGTACCGGCC CTGCCCGGAC CACATGAAAT CAGCCCCGGT
CAGCTCGCAG AATGGAAAAA TGAAAACATA GCGTATCAGC TGATAGACGT AAGGGAAGCT
TATGAATACG AAGAATACCA TATTGGTGGC ATCAATATCC CCCTTTATGA ATTAAGCCAG
CACATCCCCG CCCTGCTTCA GTATGAAAAG ATCGTTTTTT GCTGTGCTTC AGGAACAAGG
AGCAAAATTG CACTAAACCT GATGAAGAAT AATCATAAAG CTGAGTGCTA TACTTTAATC
GTTTCTGCAA ATCAGCAAAC ATAA
 
Protein sequence
MKPFGKIGTT VWFKEMENQK RYDRQVILPE IGLDGQQKLK NARVLVIGAG GLGCPLLLYL 
AAAGIGHMGI ADHDVVDESN LQRQILYQMA DIGGLKAEIA VNKLGLLNPD VDFRAYPFKL
GMENAAELIA VYDLIIDGSD NFPTRYLVND TCVALNKTLV FGSIFRFEGQ VTVFNHKGGP
DYRSLYPEPP AATEMPNCAE AGVIGTLPGI TGTLMANEAI KIICGFGEVL SGKLLLFNAL
NNEMQVFGFG SHNVQNRRAV GQKGKMDVPA LPGPHEISPG QLAEWKNENI AYQLIDVREA
YEYEEYHIGG INIPLYELSQ HIPALLQYEK IVFCCASGTR SKIALNLMKN NHKAECYTLI
VSANQQT