Gene Paes_1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1472 
Symbol 
ID6460434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1612085 
End bp1613344 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content48% 
IMG OID642725461 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002016139 
Protein GI194334279 
COG category[R] General function prediction only 
COG ID[COG2270] Permeases of the major facilitator superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.905727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.384031 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCATCGA AGCGAACCAT AGTATCCTGG CTGATGTTTG ATTTTGCCAA CACATCATTC 
AGTGTCATGA TGGTGACGTT TGCATTTCCT CTCTATTTTA AAAATATCAT CTGCGGCGCA
GAAGCCTGGG GCGATGCCAT GTGGGGCGTC AGCGTCAGTG TTTCGATGTT TGTTGTAGCC
ATTATCTCCC CGTTTCTCGG TGCGGCCGCC GATATGTCCG GCAGGCGGAA ACGTTTTCTT
CTCATTTTCA CACTGATGGC AGTTCTTGCA ACCGTGCTGC TTGGCTTTAC CGGGCCGGGT
ATGGCCGTTT TTGCAGCCCT GCTTTTTATT GTCGCCAATG TCGGTTTTGA GGGCGGACTT
GTCTTTTATG ACGCATATTT GCCTGAAATC GCTTCTGAAC GAAGCATCGG GAGACTCTCC
GGGTATGGTT TTGCCATGGG GTATTTCGGT GCGCTGACTA TTTTGCTGCT GCTTTTTCCC
CTTCTTAAAG GGGGTATTGT TCTGGAAAAC AGCCAGAATA TCAGAAAGAG CTTCTTTGTC
GTTGCCCTGT TTTTTGCTCT TTTTTCAGCA CCGCTTTTTC TTGCCCTCAG GGATAGAAAA
AAAACGGAGC TTCCCGGCAG AACGTTTATG AGCTCAATTC GCGAGGTACG CTATACCATC
ATGCATATCA TGAATTATCC TGACCTTGCG CGTTTTTTGC TCGCATTTTT CTTTTACAAC
GACGCCATTC TGACGGTTAT CGCATTTTCT TCGATCTATG CACAAAATAC GCTGGGTTTT
ACAACCTCAG AGCTGATCGT CTTTTTTATG ATCGTTCAGA CGACAGCAAT CATCGGCTCG
ATCGTCTTCG GCATAATCAC CGATCGAATA GGCCCCAAAA GGACGATTGT ACTGACACTC
TTTATATGGT GCGCCGTTAT CGTTATGGCG ATCATGACTC GTGAAAAAAC TTTTTTTTAC
TATACCGGTC TGCTTGCAGG TATGTCGATG GGTTCCTCTC AGGCAGCGTC TCGTTCCATG
ATGGCCAGGC TGACGCCTAA GGAACATGTG ACAGAGTTTT TCGGTTTCTA TGATGGCACC
TTCGGCAAGG CTTCAGCAAT TCTCGGCCCG GTGATTTTCG GTGTTGTTTC CGTCCAGGCA
GGTGATCAGC GATATGCTCT CGCATCGCTC CTGTTCTTTT TTATTCTCGG TCTGGTCTGT
ATTCTGCCTG TTCGTTCCAG CAGCACAGCT CTTCGGGCCG TTTCCCGGTC GGATGCATGA
 
Protein sequence
MSSKRTIVSW LMFDFANTSF SVMMVTFAFP LYFKNIICGA EAWGDAMWGV SVSVSMFVVA 
IISPFLGAAA DMSGRRKRFL LIFTLMAVLA TVLLGFTGPG MAVFAALLFI VANVGFEGGL
VFYDAYLPEI ASERSIGRLS GYGFAMGYFG ALTILLLLFP LLKGGIVLEN SQNIRKSFFV
VALFFALFSA PLFLALRDRK KTELPGRTFM SSIREVRYTI MHIMNYPDLA RFLLAFFFYN
DAILTVIAFS SIYAQNTLGF TTSELIVFFM IVQTTAIIGS IVFGIITDRI GPKRTIVLTL
FIWCAVIVMA IMTREKTFFY YTGLLAGMSM GSSQAASRSM MARLTPKEHV TEFFGFYDGT
FGKASAILGP VIFGVVSVQA GDQRYALASL LFFFILGLVC ILPVRSSSTA LRAVSRSDA