Gene Phep_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1049 
Symbol 
ID8252143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1232981 
End bp1236235 
Gene Length3255 bp 
Protein Length1084 aa 
Translation table11 
GC content39% 
IMG OID644934702 
ProductYD repeat protein 
Protein accessionYP_003091331 
Protein GI255530959 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGC ATTATTGTAT TCTATTATTC ATTTTTAGTT GCCTGTCTGT TGTTAAGGCA 
CAAACTAACT ATAATAAATT GCCGGTTTAT ATCCCCCCGG CACCTGATGC TGCAGCCCTT
TCCAAATATG GCAATCTGGA TGTGGGTTTA CAGACAGGTT CATTAAATTT TAAAGTGCCC
TTGCTTACTT TAGAGGGCAA TCAGTGGGCA ATGCCTATTG ATATCCAGTA TTTTACTTCC
GGGGTTAAAG TAGACCAGAT TGCATCCAGA GTTGGCATGG GCTGGGCTTT AAGTGCAGGT
GGGGTGGTAA CCAGAAGTGT TAACCATAAC CCGGATGAAT TAAGTACCCG ATCTACCCTG
CCTTCAGACT GGAACAGTTT TGGTCAGAAT TTCCTGACTT ATCTAGAGAA TGTAGCTGTG
AGCAATAGCA TGGATACTGA GCCAGATGAA TTCTCGTATA ATTTTAATGG CTATTCCGGT
AAATTTATAT TGGATGCCAA CAAAAACCCA ATATCAATTC CGCATTCCAA TCTGAAAATA
TCTTTTATTT TCAATAGTAC AACCTCATCA TCAGTAGTGA TAAAAACACC GGATGGGAGT
GTATATTATT TTGAGGATAC CGAGTATACT TCAGCTACCT CGGTGAGTGG TGGTGTAAAC
CAACCTATTG GCGCCTCCGT TCCTACCTCC TGGTACCTGA CAAAGGTTGT CCTGCCGAAT
AAAGAGACCG TTAATTTTAC CTATCAGGGT ATAGCTTATG ATTATGTAGC CGGTATTTCA
CAAACGTTAA CCAGGTCTTT TGATATTGAA AACCAGAACC CCTGTCCCAA TCTGCAATGT
GACATTGTAG ACCATGAAAA TACCAGTTTA AGTTTATTGT ATATAAATGG AAAAATGCTC
TCAAAGATCA AATTCAGGTC TATGGAAGTG AAGTTTGATT ACATCACGCG TTTGGATTTA
CCATCAGCAA ATAATGGTGA TAAATTGCTT GAAAAAATTA CCCTTAAAAA TAATAATCAG
GTATTGAAAT CATTTCAATT TGATTATCAA TATGGAATAT CTTCCAGTAC TTATGAGTCC
TGGAATGGCA ATGAGACCAG TTTAAAATAC AGGCCGTATT TGACTTTTTT AAAACAAAAG
GGAAAAAATG GAGAGGAAGA GGGAAAACAT ACATTTACCT ATGAGGACAT TAATGCAATG
CCTCCAAGGT TGTCTTTTTC GCAGGATGTA TTGGGCCACT TTAATGGGAA GTTAAATTCA
AGTTTAATCC CAAAACCAGT CAGTACAATT GATCAGCAGA TGTTTTTGAG AGCTACTGCC
GACAGGGATC CAGATTGGGG AAGTTCAGTT AAAGGGGCCT TGATTAAAAT TGAGTACCCT
ACAGGGGGTA CTGATCGTAT TTTTTATTCA TTGAACGATT ATTTTACCAC TAAAACGGTG
CAGCAACGAG AAAGCCTGAA TTTAAGTGTC CCAAGAACGA ACACCATTTA CCCAATAGTA
AAAACGTCTC CCAATTTTAC AACTACCAAT TCTCAGACGG TGACATTGGC CGCCCATATG
TGGATTGAAA TGGGTCCAGA TGGTTCATCA TCTTATGATC CGGATCAGGA CCTGATGATC
GTAGAAGTGT TGCGCAGTTC AGATGGGCAA TTGGTTTTTT CAAAAACGCT TAAAGCGGGG
CAAGATGTTT CAGAAAACCT GACCGGGATT GCATTAACAA CATCTTACTT TTACAGAATT
ACCTGTCTTG TTCCAAATTT AAGATGTACT TCAGTGATTT CTTATGATGG GGTTCCTCTG
GAAGTTACAA CAAATGTCCA GGTGCCCGGG CTTAGGGTGA GCAGTCTGAT TAGTAAGAGT
AGTGCCGCTG AGCCGGAGTT GGTTAAAAGT TTTTATTATG CGAAGTTATC AGATTTAAGT
AAGTCATCAG GAAATGTTGG ATATGCCCCA TTGTTATTTC AGGATGTTAC CTCTTATAAC
TCGGGTTTTT GTGCGGCAGA ATCATCTATG TTGGCCTATT TCCCCTGCCG AAATATGGTT
GCGTATTCTA ATTCTTTGGT TAATCAGTAT CCTTATTCAC AAAATTATAT GTATTACAGT
AATGTGATTG AAGCACTGGG ATCAAATCTG GAAAATGGTG GAACTTCCCA TGAATATATA
GTCGAAAGAG ATTCCCGGTC TATGCCTGTC ATAGGGAATA ATGTAACATT AGGTGTAAAA
TTATCTAATA ATGGCTTTTC AAATGGTCTT GAAAAAGAAC AGATCATTTT TAAAAAAAAG
GACAGCACTT TTGTTAATTT AAAGAGGACT GTAAATCATT ATCATGAAAG TGTTCGGAGC
ATTACTGATG CATACGTAAT TAATAAAACC TATGATTACC CTACCCACTC CAATCCCCCT
GCTTATCGGG AGTTTGAGGG TTTTAATATA CACAAATACC GGTTTTACTC TGCCTGGGTT
TATAAGGATA CCAGCACTGT TTATGATTAC AATGAGAACG GTGGCTATGT AAAAAGCAAA
ACAGTGTATG AATATGGCAA CCCTTTACAT ACGCAGCAAA CCAGGGTAAG CACAACTGGC
AGTGATCAAA AAACGATAGA GATCAAATAT AAATACCCTC ATGAAAAGGT TTCCTTAGGT
GAAGACCCAG GTGGTGTATA CCAGGCAATG GTTAATGATC ATCTGATTCA GCCGGTTGTT
GAAGAGCAGC GATTAAAAGA TGCAGTACAA ACCGATTTTA CACGCATTAA TTATACCCAG
CCTTTTACCC ATTTGTTTTA TCCGGATAGT GTGCAGGTGC AGTCGGTTGC AGGCAGCCCT
GTCGAAAGCA GGATACGTTA CCATAAATAT GACGATGCCG GGAATGCATT AAGTGTGTCG
CAGGAAAAAG GCAGCAAAAT ATGTTACGTA TGGGGATACG GAAGAAGTAA GCCTATAGCT
GAAATTAAAA ATGCAGATTA TGCCACTGTA GAGGCTGTTC TTGGTGGATT GGGGGCAATA
AACACATTTT GTGGCAGCTA TACGAAAACC GATACAGAGG TAAGAAATTT TCTGGCCGTT
TTAAGGACGG ATTCTAGGCT GAAGGATGCC CAGGTAGCTA GTTTTACCTA TGATCTTTTT
ATGGGGACAA CCAGCGCTAC CGATGCCAAA GGCATGACCA CCTATTATGA ATACGATAGT
TTCCAGCGGT TAAAGTATAT CAAAGACCAG GATGGGAACA TTGTCAAATC TTACGATTAT
CATTATAAAC CTTAA
 
Protein sequence
MKKHYCILLF IFSCLSVVKA QTNYNKLPVY IPPAPDAAAL SKYGNLDVGL QTGSLNFKVP 
LLTLEGNQWA MPIDIQYFTS GVKVDQIASR VGMGWALSAG GVVTRSVNHN PDELSTRSTL
PSDWNSFGQN FLTYLENVAV SNSMDTEPDE FSYNFNGYSG KFILDANKNP ISIPHSNLKI
SFIFNSTTSS SVVIKTPDGS VYYFEDTEYT SATSVSGGVN QPIGASVPTS WYLTKVVLPN
KETVNFTYQG IAYDYVAGIS QTLTRSFDIE NQNPCPNLQC DIVDHENTSL SLLYINGKML
SKIKFRSMEV KFDYITRLDL PSANNGDKLL EKITLKNNNQ VLKSFQFDYQ YGISSSTYES
WNGNETSLKY RPYLTFLKQK GKNGEEEGKH TFTYEDINAM PPRLSFSQDV LGHFNGKLNS
SLIPKPVSTI DQQMFLRATA DRDPDWGSSV KGALIKIEYP TGGTDRIFYS LNDYFTTKTV
QQRESLNLSV PRTNTIYPIV KTSPNFTTTN SQTVTLAAHM WIEMGPDGSS SYDPDQDLMI
VEVLRSSDGQ LVFSKTLKAG QDVSENLTGI ALTTSYFYRI TCLVPNLRCT SVISYDGVPL
EVTTNVQVPG LRVSSLISKS SAAEPELVKS FYYAKLSDLS KSSGNVGYAP LLFQDVTSYN
SGFCAAESSM LAYFPCRNMV AYSNSLVNQY PYSQNYMYYS NVIEALGSNL ENGGTSHEYI
VERDSRSMPV IGNNVTLGVK LSNNGFSNGL EKEQIIFKKK DSTFVNLKRT VNHYHESVRS
ITDAYVINKT YDYPTHSNPP AYREFEGFNI HKYRFYSAWV YKDTSTVYDY NENGGYVKSK
TVYEYGNPLH TQQTRVSTTG SDQKTIEIKY KYPHEKVSLG EDPGGVYQAM VNDHLIQPVV
EEQRLKDAVQ TDFTRINYTQ PFTHLFYPDS VQVQSVAGSP VESRIRYHKY DDAGNALSVS
QEKGSKICYV WGYGRSKPIA EIKNADYATV EAVLGGLGAI NTFCGSYTKT DTEVRNFLAV
LRTDSRLKDA QVASFTYDLF MGTTSATDAK GMTTYYEYDS FQRLKYIKDQ DGNIVKSYDY
HYKP