Gene Phep_1371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1371 
Symbol 
ID8252471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1629736 
End bp1632675 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content45% 
IMG OID644935024 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_003091647 
Protein GI255531275 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.940819 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTTAA ATGTCCACTC TTCCTATAGC CTGCGTTATG GCACCATGTC TGTAGAACAG 
CTTGTAAAAG AGGCAGTTTC TTTGGGAATT GAGCAAATGG CCATTACAGA TATCAATAAC
TCTACCGGGG TAATGGAATT TATGCGGGAA TGCAGGGCGC ATGGTGTCAG GCCGATCGGT
GGAATAGAAT TCAGGAAGCA GAATAAATTG TTATATATCG GTATAGCCAG GAACAAGGAA
GGAATGAAGG AACTGAACGA CTTTTTGACC TTTCATAACT TCAGCAAAAA GGAATTGCCC
GCCAGGGCAG ATGCGTTTAA AAATGCCTGT GTGATCTATC CTTATCAAAG TGGCATAGCA
GTCGGGACAG ATGATTTTAT AGGCATCCGG CCAGGGCAGT TGAACCTGCT TTACGGAAGG
GACCTGAGCC AGCTGAAGGA TAAGCTGGTG GTTTGGCAGC CGGTAACGGT TTACAACCGT
TTACACTACC GCCTGCATGA ATACCTGCGG GCAATAGACC TGAATACACT GCTTACTAAG
GTTGATGAAG GTAATAAATG CAGGCCGGAC GAGCATTTTT TGCCCCCCGG TGAGCTGAAG
CATTTGTATG CCGCTTATCC TTTTATTCTT GAAAATACAG AAAAATTAAT GGCGGGTTGT
ACCCTTAATT ATGTAAACAA AGAAAGAAGG AACAAAAAGA CTTACAAAGG GAGCAAGGCA
GAAGACAAAC TGCTGCTGGA ACAGCTGGCT TGGGCCGGAC TTGAGCAGCG TTATAAAGGT
GAAAATCATG AGGCCGCAGA ACGGCTGAAA AAAGAACTGG AAGTTATCGA AGAGCTTGAT
TTCTTTGCTT ATTTTCTGAT CACCTGGGAC ATCATCCGCT ATGGCAAAAG TAAGGGCTAT
TATCATGTAG GCCGGGGATC AGGGGCAAAC AGTATTGTGG CTTACTGCCT GTTCATCACC
GATGTAGACC CGATAGAGCT GGATCTTTAT TTTGAGCGGT TTCTGAATAA GAACCGCAGT
TCGCCCCCGG ATTTTGATAT AGATTATTCA TGGGATGAAA GGGAAGATAT ACAACAGTAT
ATTTTTAATA CTTATGGTGA AAAGCATACG GCCATGCTGG GTACGATGAG TTCATTTAAA
GACCGCTCGG TCTTCCGGGA GATCGGGAAG GTAATGGGAT TGCCCAAAGC AGAGATCGAT
GATTTCCTAG ATCCTGTAAA ACAGGAAGAA CACAAAAAAA ATCCTGTTTA TAAAAAGATC
ATGGCTGCGC ATGGCATGAT GGAGAAAATG CCTAACCAAC GTTCTATACA TGCCGGGGGG
ATATTGATTT CGGAAGATCC AATCACTTAT TATACCGCGC TGGACCTGCC CCCAAAAGAT
TTTGCCACGG TACAATGGGA CATGTACGAA GCAGAAGCTA TAGGGCTGGA TAAATTTGAT
ATTCTGAGTC AGCGGGGCAT TGGCCATATT AAAGAGGCCG TGCAGCTCAT CAGGCAAAAT
ACGGGTGATG TGGTCGATAT CCATGATACC AGGAGCTTGC TGAAAGATCC CAGACTTAAT
GGTTTGCTGA AAGAGGGTAG CAGCATTGGC TGTTTTTATA TCGAATCGCC CGCCATGCGG
CAATTGCTGG TCAAACTGGC TTGTGAAGAC TACCTTACAC TGGTGGCAGC CAGTTCCATT
ATCAGGCCCG GTGTAGCGCA ATCTGGTATG ATGGCGACCT ATATTTACAA CTATCACCAT
CCTGAAGAAG TGAAATACCT GCACCCGATT ATGGAAGAGC ACCTGAAAGA TACCTATGGG
GTAATGGTTT ACCAGGAAGA TGTGATCAAA ATCTGTTACC ATTATGGAGG ACTTGATCTG
GCAGATGCTG ATATCCTGAG AAGAGGAATG AGTGGAAAGT ACCGTTCAAA AAAGGAATTT
GACCGCCTGA TAGAGAGTTT TTTTATCCAT GCAAAGAAAG AGGGCAGGGA TGAGGAGGTA
ACCAAAGAAA TCTGGAGGCA GGTGGCTTCG TTTGCCGGAT ACAGTTTTTC TAAGGCCCAT
TCCGCAAGTT ATGCGGTAGA GAGCTATCAG AGCTTATTTT TGAAGACTTA TTACCCTAAA
GAGTTTATGG TTGCTGTACT CAATAACTAT GGAGGCTTTT ACCAGCGTTG GGTATATGTG
CACGAGTTAC GTAAAGTAGG GGCCGTAGTA CACCTGCCCT GTGTAAACCA CAGTGAAGAT
GTTGTAAGCA TCAATGGGTC AGATGCCTAC CTGGGTCTGA TTGGTATTCA GGGGCTGGAG
AGCCGGCATA TGACCCTGAT CCCCAAAGAA CGGAGGGCCA ATGGTCTGTA TAAAGGATTG
GAAGATTTTG TACGGCGTAC AGGAATCACA CTGGAACAGG CCATCCTGCT CATCAGGATC
GGTGCATTGC GCTTTACAGG TTCCAGTAAA AAAGCCTTGC TCTGGGAGGT GTACAGTTAC
CTGGGCAATA AGCAACCGGA AGTACCCTCA CAGGAACTTT TCCGGATGGA AAGCAAAGCC
TGTGTGCTTC CGCCCCTGGA TTCAGACAAA CTGGAGGATG CATATAATGA ACTCGAACTG
CTGGGCTATC CGGTAAGTAT GGGAATGTTT GACATGCTGA AAACAGACTA CAGAGGTGAT
GTGAAAGCTT CAGGTTTAAG AGCATATATT GGTCAAACTA TAAAAATGGT AGGTTTGTAC
GTCTGCGAAA AAACAGTGCA CACCAAAAAC AACAAAAAAA TGTGGTTCGG TACCTTTTTA
GACGCCGAGG GGAATTTCTT TGATACCACA CATTTCTCTA CCCATACACC TGTATACCCT
TTCAGGGGAA AGGGCTGTTA CCTGATATTG GGCAAAGTGG CGACAGATTT TGGCTTTCCA
AGTATTGAAG TGTTCCGCTT TGCCAAATTG CCCCTGGTAG ATAACCCGGT AATGGCTTAA
 
Protein sequence
MFLNVHSSYS LRYGTMSVEQ LVKEAVSLGI EQMAITDINN STGVMEFMRE CRAHGVRPIG 
GIEFRKQNKL LYIGIARNKE GMKELNDFLT FHNFSKKELP ARADAFKNAC VIYPYQSGIA
VGTDDFIGIR PGQLNLLYGR DLSQLKDKLV VWQPVTVYNR LHYRLHEYLR AIDLNTLLTK
VDEGNKCRPD EHFLPPGELK HLYAAYPFIL ENTEKLMAGC TLNYVNKERR NKKTYKGSKA
EDKLLLEQLA WAGLEQRYKG ENHEAAERLK KELEVIEELD FFAYFLITWD IIRYGKSKGY
YHVGRGSGAN SIVAYCLFIT DVDPIELDLY FERFLNKNRS SPPDFDIDYS WDEREDIQQY
IFNTYGEKHT AMLGTMSSFK DRSVFREIGK VMGLPKAEID DFLDPVKQEE HKKNPVYKKI
MAAHGMMEKM PNQRSIHAGG ILISEDPITY YTALDLPPKD FATVQWDMYE AEAIGLDKFD
ILSQRGIGHI KEAVQLIRQN TGDVVDIHDT RSLLKDPRLN GLLKEGSSIG CFYIESPAMR
QLLVKLACED YLTLVAASSI IRPGVAQSGM MATYIYNYHH PEEVKYLHPI MEEHLKDTYG
VMVYQEDVIK ICYHYGGLDL ADADILRRGM SGKYRSKKEF DRLIESFFIH AKKEGRDEEV
TKEIWRQVAS FAGYSFSKAH SASYAVESYQ SLFLKTYYPK EFMVAVLNNY GGFYQRWVYV
HELRKVGAVV HLPCVNHSED VVSINGSDAY LGLIGIQGLE SRHMTLIPKE RRANGLYKGL
EDFVRRTGIT LEQAILLIRI GALRFTGSSK KALLWEVYSY LGNKQPEVPS QELFRMESKA
CVLPPLDSDK LEDAYNELEL LGYPVSMGMF DMLKTDYRGD VKASGLRAYI GQTIKMVGLY
VCEKTVHTKN NKKMWFGTFL DAEGNFFDTT HFSTHTPVYP FRGKGCYLIL GKVATDFGFP
SIEVFRFAKL PLVDNPVMA