Gene Paes_1233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1233 
Symbol 
ID6459490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1340537 
End bp1341781 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content49% 
IMG OID642725221 
Producthypothetical protein 
Protein accessionYP_002015906 
Protein GI194334046 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0143332 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.037479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATGA AGAAACGATT TTTCCCACTT ACTGTTGCCG CATTTGCAGC AATCGGCAGT 
TCTTTACTCC CGGCAACTTT CCTCAACGCC GGATCAAGTC CACCCGGCAA TACAGGGCAG
GAAGAGAGTC TTACTCAAAA GGCGATGGAA GAGTATGGTC GGCACTATGC CAATCTCATG
CTGGTCGAGC AACACCGCCC TTACGATGAG AAGCTGGTGC AGATCGCTCT CCTGCTGGAT
ACAAGCAACA GTATGGACGG GCTGATCAAC CAGGCTAAAA GCCAGTTGTG GCGGATTGTC
AATGAACTTT CCAGGGCGCA TAAGCGGGGG AACGATATAC GACTTGAAGT TGCACTTTAT
GAATACGGCA ATGATCGCCT TGCGATGACG GCCGGCTATA TTCGTCAGGT GACGCCGTTT
ACCGAAGATC TTGACTGGTT GTCCGAGGCG CTTTTTTCCC TTCAGACAAA CGGTGGTTCC
GAGTATTGCG GTCATGTTAT CGGGAGCAGT CTTAACCAGT TGGGATGGAA CCGGTCGGGA
GATGGACTGA AGATGATATT TATTGCAGGT AATGAGCCTT TTAATCAGGG TTCGGTGAAC
TACGAGGTTT CCTGCCGCTG GGCTGTCGAG AGAGAGATTG TTGTGAATAC CATCTATTGC
GGGCCCTATC AGAGAGGCAT TGACACACTC TGGCAGCAAG GTGCCAATAA AGGCGGAGGC
AGTTATTTCG CCATAGACAG TGACAAAGTC CTGAAGGGGA TCGTAACGCC TTATGATGAT
GATCTTCTGA AGCTGAACAG CGCAATCAAT AGTACCTATA TCCCGTATGG AAGCAAGGGT
GAGCAGAACC TTTCCCGTCA GGCTGAGCAG GATATGAATG CCTCAAAGCT TTCCCCATCC
ATTTCTGCGG CAAGAGCCGC TTCAAAAGGT TCGAAACTCT ACAAGGCGTC AGACTGGGAT
CTGGTTGACG CCCTTGAAGA AAAGAAAATA TCGATTGAAA ATATAAGCAG GGATGCTCTG
CCGAAAGAGC TGCAGGAGAT GAGGCCTGAA AATCTTGGCC AGTTCGTTCA GCAGAAAAAA
GAGGAGCGTG AAGAGATCAG GCAGAAGATT GCAGCATTAA GCCGTAAAAG GGATGACTAC
ATCCAGAAGA AAGAACATGA ATCGGCAGGG GAGCAGACAC TTGGTTCCGC TATTCTCAAG
ACACTCCATA CTCAAGCAGA AGCGAAAAAC TTCAGGTTCG AGTAG
 
Protein sequence
MNMKKRFFPL TVAAFAAIGS SLLPATFLNA GSSPPGNTGQ EESLTQKAME EYGRHYANLM 
LVEQHRPYDE KLVQIALLLD TSNSMDGLIN QAKSQLWRIV NELSRAHKRG NDIRLEVALY
EYGNDRLAMT AGYIRQVTPF TEDLDWLSEA LFSLQTNGGS EYCGHVIGSS LNQLGWNRSG
DGLKMIFIAG NEPFNQGSVN YEVSCRWAVE REIVVNTIYC GPYQRGIDTL WQQGANKGGG
SYFAIDSDKV LKGIVTPYDD DLLKLNSAIN STYIPYGSKG EQNLSRQAEQ DMNASKLSPS
ISAARAASKG SKLYKASDWD LVDALEEKKI SIENISRDAL PKELQEMRPE NLGQFVQQKK
EEREEIRQKI AALSRKRDDY IQKKEHESAG EQTLGSAILK TLHTQAEAKN FRFE