Gene Paes_2233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_2233 
Symbol 
ID6459673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2413592 
End bp2414788 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content55% 
IMG OID642726209 
Productprotein of unknown function DUF323 
Protein accessionYP_002016882 
Protein GI194335022 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00201731 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATA TTTTTATCAG TTACGCCCGC GAGGATGAGC GTCGGGTTAA GCCTATCGTG 
CAGCATCTCC AGCAACTGGG TTGGCGAGTA TTCTGGGATC GGAATATTCC GCCGGGCCAG
AGCTGGGATG AATACATAGA ACAGCATCTG GAGGCTTCAC GCTGTGTGGT GGTCGTCTGG
TCGAAGCACT CTGTGGGCTC CCGATGGGTG AAGGCAGAAG CCGAAGAGGC AAAAAACAGG
AATATTCTTG TGCCCCTGTT ACTTGACAAG GTAAAACTCA GTCTGGGATT CAGGTACATC
CAGGCAGCCG ACCTGACCTC ATGGAACCAT GATGATAGAA CGCATCCCCA ATACAGGGCA
TGTATCGATG CTATTGCCCG TATGATCCCG CAGTCAGGGC CGTTAGAGCC GGCTTCTCCG
GATGTGTTCG TTACTCCGGT TGAGCCTCCT GCTTCCAGGC CGCAGCTTCC GGAAAACTTC
GTCCTGATAA AGGGTGGGCA GTTCAGCATG GGCAGCCCTG AAGACGAACA TGGTCATGAG
TCTGATGAAA CCCTGCACGA AGTGAAGGTG AGCGACTTTG CTCTCTGCAG GTATGCGGTG
ACGGTCGGAG AGTATCTTGA ATTTACCGAA GAGGCCAAGA TCAATTATGA CGCTGGCACG
GAGGGCGATC GTTATCCTGT GGTCAATGTT TCGTGGAATG ATGCAGTTGC CTATTGCCGG
TGGTTATCCG AAAAGCGGGG CGAGCTTTTC CGGTTACCGA CAGAGGCAGA GTGGGAGTAT
GCGTGTCGCG GTGGGACCAC AACTCCTTTC AGCACAGGAG AGAACCTGAC CACCGATGAG
GCAAACTATG ACGGGAACTA CCCATACCGT AATAATTCTA AGGGGAAGTA TCGCGAAGCA
ACGGTGCCGG TGGACAGCTT CGAACCCAAC AGTTACGGAT TGTACAACAT GCACGGCAAT
GTATGGGAGT GGTGCGGTGA CTGGTATGGG GAGAAATACT ATGAAGAGTG CCGGAAGAAA
GGTGTGGTGG AGAATCCGCA GGGGCCGAAA GAGGGTTCGC GCCGTGTTCT TCGTGGTGGT
GGCTGGACCT ACTATGCGCG GTACTGCCGG TCGGCTGATC GCAGCAGCGG CGGCCCCGGC
TATCGCGGCA ACAATGTGGG CTTCCGCCTT GTGTTCGTCC CGCAGTTCAA AGGGTAG
 
Protein sequence
MSDIFISYAR EDERRVKPIV QHLQQLGWRV FWDRNIPPGQ SWDEYIEQHL EASRCVVVVW 
SKHSVGSRWV KAEAEEAKNR NILVPLLLDK VKLSLGFRYI QAADLTSWNH DDRTHPQYRA
CIDAIARMIP QSGPLEPASP DVFVTPVEPP ASRPQLPENF VLIKGGQFSM GSPEDEHGHE
SDETLHEVKV SDFALCRYAV TVGEYLEFTE EAKINYDAGT EGDRYPVVNV SWNDAVAYCR
WLSEKRGELF RLPTEAEWEY ACRGGTTTPF STGENLTTDE ANYDGNYPYR NNSKGKYREA
TVPVDSFEPN SYGLYNMHGN VWEWCGDWYG EKYYEECRKK GVVENPQGPK EGSRRVLRGG
GWTYYARYCR SADRSSGGPG YRGNNVGFRL VFVPQFKG