Gene Phep_4000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4000 
Symbol 
ID8255134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4830365 
End bp4831330 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content44% 
IMG OID644937664 
Productprotein of unknown function DUF6 transmembrane 
Protein accessionYP_003094253 
Protein GI255533881 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTCAC AATTAAACAA AAAAGCCTCT CCCTTAATCG TTATCCTGGC CTTTGCCATA 
GTATATATCG TTTGGGGCTC TACTTACTTT TTTATCCAGA AAGCCCTCGC CGGCTTTCCT
CCTTTTATCC TTGGTGCCTT CAGGTTCCTT GCGGCCGGAC TGTTGTTAAT AACATGGTGC
AGCATTAAAG GGGAAAAGAT TTTTGATAAA AAAAGTATCA GACAGGCTGC CATAGCGGGC
ATACTGATGC TTGGGATAGG CAACGGACTG GTGATCTGGG TAGAACAATC TATACCAAGC
GGATTGGTTG CCATCCTGGT TTCATCTGCA GCCATGTGGT TCGTAATTCT GGACAAACCG
AAATGGAAAG AAAACCTGCA AAGTACCTCT ACAGTAATGG GATTGATTAT AGGGTTTATA
GGTGTTATCC TACTATTTGC AGAACAGGTA ATGCATACGC TGAACAACAA CCAGAGCAAC
ACTCAAATTG TAGGTATTGT ACTTTTACTG CTGGCACCAA TAGGATGGGC AGCGGGTTCC
CTGTATTCAA AATACAATAC GACCACTACC GTTTCGGTAT CTGTAAATAC CTCCTGGCAA
ATGCTGGCCG CCGGTATCGC TTTTATGCCG GGTATCCTGT TCAATTCAGA ACTTAAAGAT
TTCGATTGGC ATATGGTATC AGCTGATGCC TGGTTATCAG TTGGCTATCT GGTCATATTT
GGCTCTATTG CTGCTTTTAG TGCGTACGTA TGGCTATTAA GTGTACGCCC GGCAACACAG
GTGAGCACGT ATGCCTATGT AAACCCCGTA GTAGCTGTTT TGCTGAGTAT ACTTTTTACC
AGCGAAAAGG TCACTATAAT CCAGGTTGCC GGATTGGTGG TCATACTGGG CAGTGTATTG
CTCATCAACC TGGCTAAATA CAGAAAAGAG CACCAGCTTA AACAAAAAAC TGCCTACTCG
AAATAA
 
Protein sequence
MPSQLNKKAS PLIVILAFAI VYIVWGSTYF FIQKALAGFP PFILGAFRFL AAGLLLITWC 
SIKGEKIFDK KSIRQAAIAG ILMLGIGNGL VIWVEQSIPS GLVAILVSSA AMWFVILDKP
KWKENLQSTS TVMGLIIGFI GVILLFAEQV MHTLNNNQSN TQIVGIVLLL LAPIGWAAGS
LYSKYNTTTT VSVSVNTSWQ MLAAGIAFMP GILFNSELKD FDWHMVSADA WLSVGYLVIF
GSIAAFSAYV WLLSVRPATQ VSTYAYVNPV VAVLLSILFT SEKVTIIQVA GLVVILGSVL
LINLAKYRKE HQLKQKTAYS K