Gene Phep_1622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1622 
Symbol 
ID8252724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1923501 
End bp1924928 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content43% 
IMG OID644935276 
Productsodium:dicarboxylate symporter 
Protein accessionYP_003091897 
Protein GI255531525 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.347157 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAA ACAGGGTCGG GTTAATCACT TTGTTGTTTA TCACTGTTGC GGCATGTTTA 
GGCGTTTTAA AGGATTTCAA CATTGTTGAT GTACCGGACA ACGTCATGAT GGCCACACGC
TGGCTGCTGG CCCTCGAACT ATTGGCCTAC GCTTTGATCC GAAAAAACCT TACTACCTGG
ATATTGGTCT GTATGGTACT GGGGATATTC ATAGGGATAG ACTATCCGCA TGTTGCTGTA
GCACTTCAGC CCCTGAGTAA GGGCTTTATT AAACTGGTTA AGACCATTGT AGGACCTATC
TTATTTGGTA CCCTGGTTTA TGGAATAGCC GGACATTCTG ATTTAAAGCA GGTAGGGCGG
ATGGCCTGGA AGTCTATGTT GTATTTTTAT TTTGCAACTA CCATAGCCTT GTTTATTGGT
CTGGCAGCAA TTAACCTTAC CCATGCGGGG GTTGGTGTAA ATATAGAAAA TATGCCCCAT
AACGATTTGC CAAAACCTGT GAAGGATGTA ACAGACGAAA GCATACTGAG TACTTTGCCG
GATGGGGTAC ACTGGCTGTA TAAGACGCTG GCCTTTTTCA GGAATATTTT TCCGGAGAAT
ATTGTGAAAT CGGTGTATGA TGCCCAGATT TTACAGATCG TTATTTTCTC GGTTATTTTT
GGTATTGGTT TGGCAATGGT GGATGAAAAG AAAAGAAAAC CTATGGTTGA TTTTTGCGAG
AGCTTATCTG AGACCATGTT TAAATTTACC AATGTGATCA TGTACTTTGC CCCTATAGGG
GTAGGTGCTG CAATGGCCTA TACGGTAGGG CACATGGGCG TAGATATCCT GAAACACCTG
TTCATGCTGG TAGCTACCTT GTATATGGCG CTCATTGCCT TTATTCTTAT TGTACTGCTG
CCCATCGCGC TTTATATCAA ACTGCCTATA CTTAAGTTTA TCAATGCCAT AAAGGAACCG
GTTTCTATTG CGTTTGCCAC CACAAGTTCC GATGCCGCAC TTCCCAAAGC GATGAGTGCC
ATGGAAAAAT TTGGTGTACC ACGTAAGATC GTATCTTTTG TAATCCCCAC CGGTTACAGC
TTTAACCTTG ACGGCACTAC ATTATACCTT TCGCTGGCCT CTATTTTTGT GGCACAGGCC
GCAGGGATGC ACCTGAGTTT TGGCGAACAG CTCCTGATTG TATTTACACT GATGATCACC
AGTAAGGGGG TGGCCGCCAT TCCAAGGGCA TCACTGATTA TTCTGATCGC TACTGCAGAC
CAGTTTGGCC TGCCAACCTT TATTATTGCT GCTATTTTAG GGATTGATGA GTTAATGGAT
ATGGGAAGGA CATCTTTAAA TGTAATCGGA AATTGTCTGG CTACAGTTGT GATAGCCAAG
TGGGAGGGAG AATACAACCC CGATAGTATT GAAATTAAGG AAGCATAA
 
Protein sequence
MKQNRVGLIT LLFITVAACL GVLKDFNIVD VPDNVMMATR WLLALELLAY ALIRKNLTTW 
ILVCMVLGIF IGIDYPHVAV ALQPLSKGFI KLVKTIVGPI LFGTLVYGIA GHSDLKQVGR
MAWKSMLYFY FATTIALFIG LAAINLTHAG VGVNIENMPH NDLPKPVKDV TDESILSTLP
DGVHWLYKTL AFFRNIFPEN IVKSVYDAQI LQIVIFSVIF GIGLAMVDEK KRKPMVDFCE
SLSETMFKFT NVIMYFAPIG VGAAMAYTVG HMGVDILKHL FMLVATLYMA LIAFILIVLL
PIALYIKLPI LKFINAIKEP VSIAFATTSS DAALPKAMSA MEKFGVPRKI VSFVIPTGYS
FNLDGTTLYL SLASIFVAQA AGMHLSFGEQ LLIVFTLMIT SKGVAAIPRA SLIILIATAD
QFGLPTFIIA AILGIDELMD MGRTSLNVIG NCLATVVIAK WEGEYNPDSI EIKEA