Gene Paes_2101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_2101 
Symbol 
ID6458397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2281176 
End bp2283197 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content32% 
IMG OID642726083 
Producthypothetical protein 
Protein accessionYP_002016756 
Protein GI194334896 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGGGG AAATGATAAG AGAGCCTAAA GGTGAAGTTA TTGTGATATT CGTACACGGA 
TTGCTTGGAG GTGTTGGTGA TTCATGGACA AATTCAAATG GTACATATTG GCCGGAATTG
CTGGAAAATG AATCCGATTT AGAATCGGTG GGTATATATG TGTTTACATA TCATTCTGAT
TTTTCTAGTG GAGACTATGG ACTTAGTGAT ATAGTGGATG CATTACGAGA GCAGATGAAG
ACAGATGATA TTATTAAGAA TAAAAAATTA CTATTTATTT GTCACAGTAT GGGCGGCATA
GTAGTGAGAA AATATTTGGT TGAAAATGCA AGTAAGCTAA TTGAATGTGA TTGCGAAATT
GGTTTATACC TTTTGGCATC TCCATCATTA GGTTCTTTAT ATGCTAATTG GTTAGAGCCT
ATAATAAATA TTTTCAATCA TAGGCAAGCT TCAATGCTTA AATTTATAAG AAGAAATCCA
CATCTCTCAG ATTTAGATAG AGAATTTATG AATCTTAAAG AATTAAGTGA TTTAAATATT
GTGGGTAAAG AAATTATTGA AGATAAATTC ATATTTTTTG ATAAATTTTT TAAAAGACAG
ATTGTTGAGC CATTTTCTGG TGGCAGGTAC TTTGGTGAAC CTTTTAAGGT TCCAATGTCT
GATCATTTTT CAATAGCACA ACCTGAAAAC AAGCATGCAA TTCAGCACAG ATTGTTATGT
CAATTTATTT CAGATTTTAC ATTTAATAAA TATTTATTTC CACATTCGAT AAAAAGAATT
TTTTCGAGGG CGGATAGTCA CGACTTCCAG GAAAACATTG AATCTCTTAT AAGAGGTTGT
GGTCATTTTG TGCTTATTGG GACAGGTTTG ACAATATTAC AGAAAGACCC TTTTGCTTAT
GAAGTGTTTG AGAGGGCAAA GAATAATGAA TTTAAAATTG AGATATATCT TGCTGATCCA
CATAGTCCAG ATGTACAATG TCGTCTTATA GAAGAGGAAT TAGGCACTTT GAAGCCGCCT
GTTGGAAAGT CGGGTTTGAC TAAAAGACTT GATACGTTAT ATGGGTTATG GAAGGATTTT
GATTTTAGCG ATAATATTTC TATTAACGTT TTTCGTAATT ATCCAACATT TGCTCTTATT
ATTATAGATG ATAATTATTT TATATACCCA TACGGGTTTG CTAAGTTGGG TAATTTTAGT
CCCGTTATGT CTTTTTTGAA AACTGGAAAC ACAGATGATT CTATGATTAG ATTTTTGGAT
GATCAATATG TTTCAATTAA AAATAGTTCG TGTGATTTAC GCAAGATAAG GTCCAGGGGG
AATGATGATG CTGAAATAGT GAAAGATTTG TATTCTTTTG CTTTATATAT TGTGCCACCT
AAAGATAGTG ATTTGTATGT TTTCGGTACA GATGTGCTAG GCTATGATGT AAGAGCTCGA
CTTAATAAGA AAAGTCAGTG GGAAGATTTT GTGGGGGACG CTTTTGAATA TGGATTTCAT
CTTACAATAT GCGATGCACT ATATTTTTAT AACGTAAGTG ATGTAAAATT AGCTGTTACA
GCTATAGAAT ATATATCGAA AGATTTTGTG CCTTTCGAAA TAAACAACTT ACGCATTAGA
GAAAGTTACC CTAGTCAAAA TTGTTTATCT GTTGTAGGTG ATGATGCAGG AGGTTCATTA
GAGGCTTTAC ATTTTGAGTT TGTTACAAAT GTCTATCGTC GCGCAGCAGA GTCAAATTAT
TCCTTAGGGA TGGCAGGGCC ACCTCGGGAT AAAAACATAC ATAGATCAAG ATTAATGATA
GAAAAATATA AAGCACCGTA TATCATAAAA AAATTCTGCC CTCACTTTAC GCTTTTAAAT
AAAATAAATA ATTCATCAAT GAAAGCTGTA AGTGAAAAAT TAAATGTTAT TTTTTTAAAT
TCTGTAAAAG ATACAACATT GAGAGTTGAT TCTTTGGCTT TAATGAAGAA AGATTACTAT
AAAGGTAAGT GGGTGATAGA AAAAGAAATA AGATTAGGTT GA
 
Protein sequence
MNGEMIREPK GEVIVIFVHG LLGGVGDSWT NSNGTYWPEL LENESDLESV GIYVFTYHSD 
FSSGDYGLSD IVDALREQMK TDDIIKNKKL LFICHSMGGI VVRKYLVENA SKLIECDCEI
GLYLLASPSL GSLYANWLEP IINIFNHRQA SMLKFIRRNP HLSDLDREFM NLKELSDLNI
VGKEIIEDKF IFFDKFFKRQ IVEPFSGGRY FGEPFKVPMS DHFSIAQPEN KHAIQHRLLC
QFISDFTFNK YLFPHSIKRI FSRADSHDFQ ENIESLIRGC GHFVLIGTGL TILQKDPFAY
EVFERAKNNE FKIEIYLADP HSPDVQCRLI EEELGTLKPP VGKSGLTKRL DTLYGLWKDF
DFSDNISINV FRNYPTFALI IIDDNYFIYP YGFAKLGNFS PVMSFLKTGN TDDSMIRFLD
DQYVSIKNSS CDLRKIRSRG NDDAEIVKDL YSFALYIVPP KDSDLYVFGT DVLGYDVRAR
LNKKSQWEDF VGDAFEYGFH LTICDALYFY NVSDVKLAVT AIEYISKDFV PFEINNLRIR
ESYPSQNCLS VVGDDAGGSL EALHFEFVTN VYRRAAESNY SLGMAGPPRD KNIHRSRLMI
EKYKAPYIIK KFCPHFTLLN KINNSSMKAV SEKLNVIFLN SVKDTTLRVD SLALMKKDYY
KGKWVIEKEI RLG