Gene Phep_3569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3569 
Symbol 
ID8254691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4247443 
End bp4248513 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content44% 
IMG OID644937221 
ProductKelch repeat-containing protein 
Protein accessionYP_003093822 
Protein GI255533450 
COG category[S] Function unknown 
COG ID[COG3055] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03548] cyclically-permuted mutatrotase family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.396021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.14422 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGC TATCTATTTA TTTATTCTTA TTGGGAAATA TAATGTTGAC TACTATAAAT 
TCAAAAGCCC AGGTTATGCC GGTGTTCAGT GAGCTTACAT CTTTACCCGA TTCCGAGGGC
TATGCAGGAA TGTTTGCCGG GGTTAGCAAT GGAAGGTTAT TTTGCCTTGG CGGTGCCAAT
TTTCCTGATA AACGACCCTG GGAAGGCGGT AAAAAGAAGT GGTATGATGA AATCTACATG
TTTCAGGAAG GCAAGGACTG GGTAAAGCTG GCTGATAAAC TACCATCTCC ACTTGGTTAT
GGAATAACTG TCAGCTATAA AAATCAATTT ATAATTGTGG GTGGTAACCA TGCAGCAGGA
TTTTCGGACA AAGTATATGG ATATGAATGG ACGGATGGCA GATTAAAAAT GGTCCATTAT
CCGCAATTGC CTGTTCCCCT AGCCAATATG GCAGGAACAC TTGTTGGCCA GCTAATCATC
CTTGCCGGGG GGAATAGCTC TGCTACAGGC AGGGCAGGTA AACAGTGTTA TGTGCTGGAT
CTTGAAGCGA TTGACAGTGG ATGGTCTGCA TTGCCATCCT GGCCAGGAAG GGAACGGATG
TTACCTCTAT GTGCTGTGTA TGGTGGTATG TTTTATTTGT TTGGCGGAGA AACTACTGGG
ATTAATTCCT TAAGTCAACA TTACCGGCTT ATCCTGGATG ATGCCTACAG CTTTAAACCA
AAAAAGGTGG ATGGAAGATG GACCGGGACC TGGACTACAC TTTCTCGTAT GCCTAAAGGG
CTGTCAGCCG GTGGTAGTCC ATTACCCGTA TTGGAAAATG GTGACGTAGT GTTTTGGGGT
GGTGTTGATG CGTTAACCTC TTTGCATACT GATCCGCTTT CACATCCTGG AATATCGGCC
GATGTGCAGT TGTATAACCT TAATAGCGAT ACCTGGAAAT ATGCAGGTAA AAAACTGGGA
ATTGCTGCTC CTGTTACTTT GCCTGTTGTA AACTGGAACG GGCGATGGCT TTACATTAGC
GGGGAAATAA AACCTGGGAT AAGGACCAAT AAAATTTATG AGTTGAAATA G
 
Protein sequence
MKLLSIYLFL LGNIMLTTIN SKAQVMPVFS ELTSLPDSEG YAGMFAGVSN GRLFCLGGAN 
FPDKRPWEGG KKKWYDEIYM FQEGKDWVKL ADKLPSPLGY GITVSYKNQF IIVGGNHAAG
FSDKVYGYEW TDGRLKMVHY PQLPVPLANM AGTLVGQLII LAGGNSSATG RAGKQCYVLD
LEAIDSGWSA LPSWPGRERM LPLCAVYGGM FYLFGGETTG INSLSQHYRL ILDDAYSFKP
KKVDGRWTGT WTTLSRMPKG LSAGGSPLPV LENGDVVFWG GVDALTSLHT DPLSHPGISA
DVQLYNLNSD TWKYAGKKLG IAAPVTLPVV NWNGRWLYIS GEIKPGIRTN KIYELK