Gene Paes_1467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1467 
Symbol 
ID6460559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1604913 
End bp1606214 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content51% 
IMG OID642725456 
ProductHipA domain protein 
Protein accessionYP_002016134 
Protein GI194334274 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00227623 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGACG ATGCAAGGGT GAGGCTTTGG GGCAGTGTTA TCGGTGCTGT CAGCTGGCTT 
GATGAAGGAG ATGTCGGTGT TTTCCAGTAT GCGCCCCAAT TCCTGCAAAG CGGTATTCAG
GTGGCACCTT TCATGATGCC TCTTGATGAG TTTCCTTATG AGTTTTCAGC GCTTGCCAGA
AACACATTCA AAGGCTTGCC CGGTCTTCTG GCCGACTCGC TTCCTGACAG GTTCGGCGAT
ACGGTCATTG ATGCATGGCT TATGTCGCAG GGATATGAGC CATCGGAGAT TCATGCTGTC
CAGCGGCTCT GCTTTATCGG GAGTCGAGCA ATGGGGGCCC TTGAGTTCGA GCCTGTTATT
CCCGGTACTC TGGCAGGTTC AAAGCGCGTT GATATTGCTC GCCTTGTCGC CGCGACAAAC
CTTATCCTTG AAGAGCGTCG TGCTCCACAA CGCGTTTTCA GGCGGCATGA TACGCGAGAG
ATGATCGATG CACTCTTTTG CGTTGCTGCA TCTGCCGGAG GCCGAAGAGC GAAAGCACTG
CTCGCATGGA ATCCTGTTAC AGACGAATTC AGATCCGGTC CTGCCGATGT TGGGGCAGGT
TTTGAGGATT GGATGATAAA ATTCGATGGA GTTACCAGTA GCCTCGACAG AGAAACCTCC
GATCCCGCAG GTTCAGGAAG CATCGAATAC GCTTTCCATC TGATGGCGGT TGAAGCCGGT
ATTGTCATGA TGCCATGCAG ACTGCATCAG GAAGGCGGCC GGAGCCATTT TATGACAAAG
CGCTTTGATC GGACAGCCAA CGGCGGAAAA TATCACATGC AGTCTCTGGG GGCAATGACG
CACATTGATT ACCATCAGCC TCTGAGTTAT TCCTATGAAC AGGCAATCCA GCTGATGAGA
CGCCTTGGAC TCAAGCGCGA AGATCTGGAG CAGCTTGTCC TTCGTGCAAT GTTCAATGTC
ATTGCCTGCA ATCATGACGA TCATGTCAAC AATAGTGCTT TTCTCATGAA CCGCCGCGGC
CAATGGCGAT TGTCGCCGGC GTTCGATCTT TCCTATACAT CTGATCCCAG GAGGGTGTGG
ACAATGCTTC ATCGGATGAG CATTAATGGA AAATGCGAAG ATTTCAAACG CGAGGACCTG
ATCGCTCTTG CCAGCGTCGC AGGAATCAAA AAGACAAGGG CTAACGAGAT GATCTATCGT
GTGACCGAAA CTGTTCGGCG CTGGCCATAT TTTGCAGAAA AAGCCGGTGT GACGGAAAAA
CGCATGGCTC ACATTCAGGC TGACCAGGAT ACCTTGTTAT GA
 
Protein sequence
MYDDARVRLW GSVIGAVSWL DEGDVGVFQY APQFLQSGIQ VAPFMMPLDE FPYEFSALAR 
NTFKGLPGLL ADSLPDRFGD TVIDAWLMSQ GYEPSEIHAV QRLCFIGSRA MGALEFEPVI
PGTLAGSKRV DIARLVAATN LILEERRAPQ RVFRRHDTRE MIDALFCVAA SAGGRRAKAL
LAWNPVTDEF RSGPADVGAG FEDWMIKFDG VTSSLDRETS DPAGSGSIEY AFHLMAVEAG
IVMMPCRLHQ EGGRSHFMTK RFDRTANGGK YHMQSLGAMT HIDYHQPLSY SYEQAIQLMR
RLGLKREDLE QLVLRAMFNV IACNHDDHVN NSAFLMNRRG QWRLSPAFDL SYTSDPRRVW
TMLHRMSING KCEDFKREDL IALASVAGIK KTRANEMIYR VTETVRRWPY FAEKAGVTEK
RMAHIQADQD TLL