Gene Phep_1380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1380 
Symbol 
ID8252480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1639270 
End bp1640958 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content43% 
IMG OID644935033 
ProductSSS sodium solute transporter superfamily 
Protein accessionYP_003091656 
Protein GI255531284 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.366294 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCTG CAGACTGGTC CGTATTGATC TTAACGCTAG TTGTCATAGT AGTTTATGGC 
ATTTACAAGA GCCGAGGAGC GCAAAATATT CAGGGTTACC TGCTGGGCAA TCAATCGCTG
CCCTGGTACC ATGTATGCCT TTCTGTAATG GCTACCCAGG CCAGTGCCAT CACTTTTTTA
TCAGCTCCCG GCCTGGCCTA TTCTTCAGGA ATGAGCTTTG TACAGTTTTA TTTCGGACTG
CCATTGGCGA TGATCGTCCT GTGCATCACC TTTGTGCCCA TATTTCACCG CCTAAAGGTA
TACACCGCTT ATGAATATCT GGAACAAAGG TTTGACCTGA AAACCAGGGC ATTAACGGCC
TTTCTTTTTT TAATTCAACG GGGTCTTTCA ACTGGGATTA CCATTTATGC CCCTTCGATT
ATTTTATCAA CTATACTGAA TATCAATACC ACTTATACTA CTTTGTTTAT TGGTAGCCTG
GTCATATTTT ATACCGTCTA TGGTGGTACT AAAGCCGTTT CTTATACACA GATGCTACAA
ATGAGCATCA TCTTTTGCGG ACTGTTTGCC GCAGGCATTA TGGTGGTACA CCTGCTTCCC
GGCGACATCG GTTTCAGCCG GGCCATTAGC ATAGCCGGAA AGATGGGGCG CACAAATGCC
ATAGATTTTA AATTGGACCT GAACAACCAA TATACGGTAT GGACCGGCTT GATAGGTGGT
TTCTTTTTGC AGCTTTCTTA TTTTGGTACC GACCAGAGCC AGGTAGGCAG GTATTTATCG
GGCGCCTCTG TTAGCCAGAG CAGGCTGGGC TTGTTGATGA ACGGCCTGGT AAAAATACCG
ATGCAGTTCC TGATCCTGCT GATCGGGGTA TTGGTATTCA CCTTTTATCA GTATAACCGT
CCACCCATCT TCTTCAACAG TTTTGAACTG AATAAGCTGG AAAAGAGCAG CTATGCGCCT
GAACTTGATC AAATAAAGGT AAACTATAAC CGTGCTTTTG AAGAAAAACA GCAGGAAGTA
AATCAAATGA ATGCAGCACT TGATGCAAAT GATAAAGCAC GCATTGATAC ACAGAGAAAG
GCTCTACAGG CAGCCGATGA GAAAGAAAAG GCCATCAGAA AACAGGTTAC TGATTTGATG
GTAAAGAATG ATGAGCATGC CAATATAAAA GACAATAATT ATATCTTTTT GAGTTTTGTA
ACGCAATATT TGCCAAAAGG GTTAATAGGT TTGCTGATTG CCATTATTTT CCTGGCCTCT
ATGGGCTCCA CAGCAAGTGC TTTAAATTCA CTGGCTTCTA CAACAGTAAT AGACATTTAT
AAGCGGCTGA TCAAAAAAGA TGGTTCAGAT CACCAATATC TGCAGGCATC GCGGCTGGCG
ACAGTTTTTT GGGGGGTAGT TTGTATCATA ATGGCTTTAT ATGCCAGTAA AATTGGCAAT
TTACTGGAAG CTGTAAATAT ATTAGGCTCT TATATTTATG GTACTATACT GGGTGTTTTC
CTGGTCGCTT TTTATGTAAA GCAAGTAAAC GGGAGGGCTG TGTTCATTGC AGCTTTGCTA
ACCGAAGCTA TTATCGTGCT GCTGGGTAGT CGGGATGTTG TTGCATACCT GTGGTTAAAC
GTGATTGGCT GTGTGCTGGT GGTGTTGATA TCCCTGCTGG TTCAACAAGT CATGCGTAAA
GAAAAATAG
 
Protein sequence
MSAADWSVLI LTLVVIVVYG IYKSRGAQNI QGYLLGNQSL PWYHVCLSVM ATQASAITFL 
SAPGLAYSSG MSFVQFYFGL PLAMIVLCIT FVPIFHRLKV YTAYEYLEQR FDLKTRALTA
FLFLIQRGLS TGITIYAPSI ILSTILNINT TYTTLFIGSL VIFYTVYGGT KAVSYTQMLQ
MSIIFCGLFA AGIMVVHLLP GDIGFSRAIS IAGKMGRTNA IDFKLDLNNQ YTVWTGLIGG
FFLQLSYFGT DQSQVGRYLS GASVSQSRLG LLMNGLVKIP MQFLILLIGV LVFTFYQYNR
PPIFFNSFEL NKLEKSSYAP ELDQIKVNYN RAFEEKQQEV NQMNAALDAN DKARIDTQRK
ALQAADEKEK AIRKQVTDLM VKNDEHANIK DNNYIFLSFV TQYLPKGLIG LLIAIIFLAS
MGSTASALNS LASTTVIDIY KRLIKKDGSD HQYLQASRLA TVFWGVVCII MALYASKIGN
LLEAVNILGS YIYGTILGVF LVAFYVKQVN GRAVFIAALL TEAIIVLLGS RDVVAYLWLN
VIGCVLVVLI SLLVQQVMRK EK