Gene Phep_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2054 
Symbol 
ID8253158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2370848 
End bp2372116 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content42% 
IMG OID644935702 
Productnucleoside transporter 
Protein accessionYP_003092321 
Protein GI255531949 
COG category 
COG ID 
TIGRFAM ID[TIGR00889] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0184086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAATG TTAAGTTTCG CTTAACCCTG ATGAATTTTA TGCAGTTCTT TATCTGGGGT 
TCATGGCTGA TTACTATTGG CGTATACTGG TTTCAAAATA AGAAATGGTC GGGCTCAGAG
TTTGGTGCTA TTTTTTCTAC CATGGGCATC TCCGCTATTT TTATGCCTGC ACTAACCGGC
ATTATATCAG ACCGCTTTAT CAATGCCGAG AAACTATACG GACTGATGCA CATTTGTGGT
GCAGTTGTAT TGTTCTGTTT ACCAATGGTA GATAACCCCC ATACCTTTTT CTGGGTAATA
CTGTTAAACA TGATCTTTTA CATGCCTACC CTATCCCTTT CTATTACAGT AGCTTATTCT
GCTTTAAAAA GCAACGGTAA AGATGTAGTA AAGGATTACC CACCCATCAG GATCTGGGGA
ACGATAGGTT TTATAGCTGC CTTATGGGTA GTCAGCATTT CCGGCAGCGA AGCCACATCC
AACCAGTTTT ACATTGCTTC GGCAGTTTCC CTGCTGCTGG GACTTTATGC CTTTACCTTA
CCTAAATGTC CACCATTGGC CAATAAGGTA GATTCAAAAT CATTTGTTGA TGCCCTGGGT
TTAAGGGCCT TTGCCTTGTT TAAGCAAAAG AAATTTGCTG TATTTTTTCT TTTCTCCATG
TTCCTGGGTG CCGCCCTGCA GCTCACCAAT GCGTATGGCG ATACCTTTTT ACACGATTTT
AAGAATGTAC CCGAGTTTCA GGACCTGCTG GCTGTAAAGT ATCCGGCCAT CATTATGTCC
ATTTCTCAGA TTTCGGAAAC CCTGTTTATC CTGGCCATCC CCTTTTTCTT AAGAAAGTTT
GGAATAAAAT ACGTAATGCT GTTCAGCATG CTGGCCTGGG TATTGCGGTT TGGTCTGTTT
GCTTACGGCG ATCCTGCCGG TGGCCTATGG ATGATCATTT TGTCCTGTAT CATTTACGGG
ATGGCCTTTG ATTTTTTCAA TATCTCCGGA TCGTTATTTG TAGAAACACA GATCGATGCA
AAAATACGTG GCAGTGCTCA GGGCCTGTTC ATGATGATGG TAAACGGCTT TGGCGCACTT
TTCGGCAGCT TTACCAGTGG CTTTATCATT GATAAGTTTT TTACACATAC CGATCAGAGT
AAGGACTGGC ACAGCATCTG GATCACATTT GCATCCTATA CCCTATTGCT TGCCATTGTA
TTTCCATTTG TATTCAAATA CAAACACAAT AAGGCAGAAG AAAAGGCCAT AGAAGCAATG
CGCCATTAA
 
Protein sequence
MMNVKFRLTL MNFMQFFIWG SWLITIGVYW FQNKKWSGSE FGAIFSTMGI SAIFMPALTG 
IISDRFINAE KLYGLMHICG AVVLFCLPMV DNPHTFFWVI LLNMIFYMPT LSLSITVAYS
ALKSNGKDVV KDYPPIRIWG TIGFIAALWV VSISGSEATS NQFYIASAVS LLLGLYAFTL
PKCPPLANKV DSKSFVDALG LRAFALFKQK KFAVFFLFSM FLGAALQLTN AYGDTFLHDF
KNVPEFQDLL AVKYPAIIMS ISQISETLFI LAIPFFLRKF GIKYVMLFSM LAWVLRFGLF
AYGDPAGGLW MIILSCIIYG MAFDFFNISG SLFVETQIDA KIRGSAQGLF MMMVNGFGAL
FGSFTSGFII DKFFTHTDQS KDWHSIWITF ASYTLLLAIV FPFVFKYKHN KAEEKAIEAM
RH