Gene Paes_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_2072 
Symbol 
ID6459937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2255121 
End bp2257475 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content54% 
IMG OID642726056 
Producthypothetical protein 
Protein accessionYP_002016729 
Protein GI194334869 
COG category[S] Function unknown 
COG ID[COG5617] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.404123 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGAACG CACTCCGTCC ATATTTTCTG GTTGCTTTCG TTTTTTGCGC GATGGTTGCC 
ACTCTCCTGT ATCAGGTCCT TTTTCTCGGT ATGGTGCCGT CATCGCCAGA CAGTACCGGA
CCGATGGCGA CGTCGATGGC GCTCGACGCC CTGCGCGAAT CTTCAGGCAT GTACCCACTG
TGGCAGCCAT GGTCGTTTTC CGGTATGCCG ACAGTCGAGG CATTTACCTA TCTGAACGGG
CTCTATTATC CCGGTATCGC GCTCAGCTTG TTTCACATTG ACGGTCTGCT CCTGCAACTC
CTGCATCTCG TGTTTGCCGC GATGGGGGGG TATGTGCTGC TTCGTTTTTT CAGACTGCGT
CATATGGCCG CTTTTCTTGG GGGCGCGGCC TTCATGCTCA ATCCCTATCT GGTGACGATG
TTTGTCTACG GGCACGGCAG TCAGCTGATG AGTGCAGCCT ATATGCCCTG GGTGTTCTGG
GCCGGCTTGA GGGTTCTCGA TAGTCGAAAA ATCTACGATA TCGCTTTGCT GGCACTTTTT
GCCGGCTTGC AGCTGCAGCG GGCTCATGTG CAAATTGCCT ATTATACCTG GATGTTTCTG
TGTCTGCTGA TAGTGATCAA TGTCGTTGTC AGATATACAA CTCTTCGTGA AACCGCCGGG
AAACTGGCGG CAGTATCTCT GTCGCTTGTG CTTGCATTGG CATTGGCGGC GGCAGTCTAC
ATGCCTGCCC TGGCCTATAC GCCGTTTTCG GTTCGCGGCG CTTCAGCTGG AGGCGGGGCA
GCATACGGAT ATGCGACGAT GTGGTCAATG CATCCGACCG AACTGCTGAC CTTTCTGGTT
CCCGGTTTTT TCGGTTTTGG CGGTATAGCG TACTGGGGAC ACATGCCGTT TACCGATTTT
CCCAATTATG CCGGTCTGAT TATTCTGCTG CTTGCTCTTG GCGGGGCATG GGCCGGACGC
CGCGAGCCGT TTGTCTGGTT TCTGGTTTCC TCGATGCTCG TTGCTCTGTT GCTCTCATTC
GGCAGTTTCT GGAGTCCGTT GTATGACCTG TTCTACCATT TTGCGCCGTT TTTCAGCAGG
TTTCGGGTGC CTTCGATGGT GCTGATCGTG GTCTCGCTCG ACCTTTCACT GCTTGCGGGT
TTCGGTTTGC ACGCTCTCGG CAAGGGTCTT GATAAGGGCG CTATCAGGAT CCTTAAAGGC
GGCTCGTTAG TGCTGGCTCT CTTTATTGTT TTTTTTCTTT TGTTCGAACC TTCCATCGAA
TCATGGTTCC GCAGCGCCTT TCCTCTTCCC AATGTTGAGG GGGTGCAGCT TGTTCGTCTT
ATCGAGGATG CCCGTTGGAA TCAGCTGAAA GGAAGCCTGC AGGGCGTTGT CCTCGGTTCT
GCCTTGTTTT GCGGGCTTCT CTGGCTTTCT ATTCGTCAGG TTTTCTCCGA CCGGGTAACA
CTGCTTTTCG TGGCGGCGTT AGCGCTTGGT GACATTCTGC TTGTCGATCG TCAGATTGTC
GATCCGTCAA GGGACTCGTT ACGTTCTTCG CAGCTTCAGG CGGAGGCTGT TCTTGACAAG
GTGTTCAGTG ATGGCGATGT GGCTGATTTT TTGAAAAACG AACCGGGCAT CTTCAGAATT
TATCCCGCTG GCGGGCTTTT TGGTGAGAAC CGTTTTGCGG CTGCCGGACT GGAGTCGGTC
GGAGGGTATC ATCCGGCCAA AATAGCGCGT TATGATGCAC TGCTGAAGCG AACAGCAAAT
CTTGCCGATA CTGGTGTACT TCGGATGCTC AATGTAGGTT ATGTTATTGC TCCCTCTCCT
CTTGATCATC CTGAGCTGGA GGGCGTTTAT GAAGGAATGC TTCGTCTTGT GCGGGGCAGG
CAGGATGTTT GGGTTTACCG CCTCCGCGAC CCTATGCCGA GAGCATGGTT TGCTCTGGGG
GCAACGGCAT CAGAGTCTGC GGAACAGAGT CTTTCGGGTA TGCTGCAAAG CTCGAGCGGT
CCGGCTGAGA TGGTGTTTGT CGAGGATGGC GGATGGGAAG GGCAGAGATC GTTCGCCCGC
GGAGAGGTGC TTGCAATCGA TAGAGGTCCG GAACGGTTAT CGATGAACGT CAGTTCGGAA
GGAGATGCAC TCCTTGTTGT AAGCGAGGTT TTTTACCCGC AGGGCTGGAA GGCTTCTATG
GACGGCTCTC CCGTCAGAGT CCACCCTGTC AACGGGGTTA TCCGGGGAGT GCTTGTTCCT
GAGGGTGAGC ATCACATCGT TTTCAGTTAT GACCGTACGC TTTTTGAAAA CGGGCGACGC
TATAGCCTTG CGGCAGCCTT GCTGATAGTG ATGCTCTTTG CCGGCGGGAC GCTACTGCGA
CGCAAGGCAT CGTAA
 
Protein sequence
MKNALRPYFL VAFVFCAMVA TLLYQVLFLG MVPSSPDSTG PMATSMALDA LRESSGMYPL 
WQPWSFSGMP TVEAFTYLNG LYYPGIALSL FHIDGLLLQL LHLVFAAMGG YVLLRFFRLR
HMAAFLGGAA FMLNPYLVTM FVYGHGSQLM SAAYMPWVFW AGLRVLDSRK IYDIALLALF
AGLQLQRAHV QIAYYTWMFL CLLIVINVVV RYTTLRETAG KLAAVSLSLV LALALAAAVY
MPALAYTPFS VRGASAGGGA AYGYATMWSM HPTELLTFLV PGFFGFGGIA YWGHMPFTDF
PNYAGLIILL LALGGAWAGR REPFVWFLVS SMLVALLLSF GSFWSPLYDL FYHFAPFFSR
FRVPSMVLIV VSLDLSLLAG FGLHALGKGL DKGAIRILKG GSLVLALFIV FFLLFEPSIE
SWFRSAFPLP NVEGVQLVRL IEDARWNQLK GSLQGVVLGS ALFCGLLWLS IRQVFSDRVT
LLFVAALALG DILLVDRQIV DPSRDSLRSS QLQAEAVLDK VFSDGDVADF LKNEPGIFRI
YPAGGLFGEN RFAAAGLESV GGYHPAKIAR YDALLKRTAN LADTGVLRML NVGYVIAPSP
LDHPELEGVY EGMLRLVRGR QDVWVYRLRD PMPRAWFALG ATASESAEQS LSGMLQSSSG
PAEMVFVEDG GWEGQRSFAR GEVLAIDRGP ERLSMNVSSE GDALLVVSEV FYPQGWKASM
DGSPVRVHPV NGVIRGVLVP EGEHHIVFSY DRTLFENGRR YSLAAALLIV MLFAGGTLLR
RKAS