Gene Phep_3571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3571 
Symbol 
ID8254693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4249364 
End bp4250830 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content45% 
IMG OID644937223 
ProductMicrocystin LR degradation protein MlrC 
Protein accessionYP_003093824 
Protein GI255533452 
COG category[S] Function unknown 
COG ID[COG5476] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.105473 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTAA AAGTAGCCGT GATGGGGATT ACCCACGAAT CCAATACGTT TGTAAAAGAA 
CCTACAACGC TTCGTCAGTT TAGAGAAGGA CACTGGATGA AGGGCGAAGC TTTGCTGAAT
GAATACCGTG GTGCGTTTCA TGAACTGGGT GGCATGATCG AAGTGCTGGA ACAGGAAGGT
ATTGAGATTG TTCCGGTGAT GTATGCGGAA GCAACACCAG GGGGTATAGT TTCGACGGTT
ACTTACGAGA TTTTGTTGCA GGAGTTGTTT ACAGCATTGG ATAGTGCGCT TCCGGTGGAT
GGATGCCTGG TGGTTCCGCA TGGAGCCGGG GTATCCGAAT CATTTCGGGA TATGGATGGG
CATTGGTTAA GCCTTGTCCG GGAAAAGTTG GGCGATGGCA TTCCCATTAT TGGAACGCTT
GATCCACATG GTAATGTGAG TGATCAGATG ATTGCATCAA CCAATGCATT GGTTGCATAT
AAGACCAATC CGCACATCGA CCAGCGGGAA ACTGGGCGTT TGGCCGCAAA ATTATTGGTA
AGTACGCTGA AGAAAAAAAT AAGACCCCTT CAGCACTTAA TTCAGCTACC GATGGCCATA
AGTATAGAGC AGCAGTTTAC GGGCAAAGCG CCATGTAAAA GTTTATATGC TTTGGCTGAT
GTGCTTAGCC GGCAGCAAGG TATTTTGTCG GTCAGTATCA TGTTGGGCTT TCCTTATGCA
GATGTGGAAG AAATGGGTTC TTCTATCATT GTGGTTACTG ATGACGACGA ACAGCTGGCC
ATAGGCACCG GTAAAACGCT CGAAGCTTAT ATGATGGTAC ATAAACAAGA ATTTTTTGGG
CTTAAACAAG ATATAGAGGA TTTATTGCCG AGGATTGAAA AGAGTGAAAA ACCTGTTTTA
CTACTCGATA TGGGAGATAA TGTGGGAGGA GGAGCCCCAG GAAACAGCGC CTATTTGCTG
AAAGCGTTGG AAGCTAGGGG TAAGATCAGA TCATTTATAT GTCTTTGTGA TCCTACAGCA
GTGCAGCTTG CAACCCGGCA TGCTGTGGGA GATACTTTTG AGATGGTTAC AGGAGATCAT
GAGGATTTAA CAATGAACTA TACCAGCCGG GTAAAATTAC TTTATACTGG CGATGGAAAG
TTTAAGGAAA CTGTGCCCCG GCATGGGGGG CAGGTTAATT TTGATATGGG TAAAATTGCT
CTCGTAGTTA CATCCGCAGG AAATACAATA ATGCTGACTT CCCTGCGTGT TCCTCCTTTT
AGTTTGAAGC AATTGACCAG TTTTAACATA ATGCCTGAGG AGTTTGATGT GCTTGTTGCT
AAAGGGGTAA ATGCGCCAAT TGCCGCTTAT GCTACAGTTT GCAAGACTTT GCTACAGGTT
AATACACCGG GGGTTACCTG TGCAGATATG ACCCGTTTTA ATTATATGAA CAGAAGGAGG
CCTATATTTC CTTTTGAAGA GATTTGA
 
Protein sequence
MRLKVAVMGI THESNTFVKE PTTLRQFREG HWMKGEALLN EYRGAFHELG GMIEVLEQEG 
IEIVPVMYAE ATPGGIVSTV TYEILLQELF TALDSALPVD GCLVVPHGAG VSESFRDMDG
HWLSLVREKL GDGIPIIGTL DPHGNVSDQM IASTNALVAY KTNPHIDQRE TGRLAAKLLV
STLKKKIRPL QHLIQLPMAI SIEQQFTGKA PCKSLYALAD VLSRQQGILS VSIMLGFPYA
DVEEMGSSII VVTDDDEQLA IGTGKTLEAY MMVHKQEFFG LKQDIEDLLP RIEKSEKPVL
LLDMGDNVGG GAPGNSAYLL KALEARGKIR SFICLCDPTA VQLATRHAVG DTFEMVTGDH
EDLTMNYTSR VKLLYTGDGK FKETVPRHGG QVNFDMGKIA LVVTSAGNTI MLTSLRVPPF
SLKQLTSFNI MPEEFDVLVA KGVNAPIAAY ATVCKTLLQV NTPGVTCADM TRFNYMNRRR
PIFPFEEI