Gene Paes_1452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1452 
Symbol 
ID6459155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1584504 
End bp1587665 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content48% 
IMG OID642725439 
Productprotein of unknown function DUF323 
Protein accessionYP_002016119 
Protein GI194334259 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGATA AGCCGCACAG AATCAGGATT CTTATTGCTT CTCCTTCTGA TGTTCAGGAA 
GAACGAAAGA GGGCCATAGA GGTAATCAGG CAATGGAATG CGTCACAGGA GAGTGTTTTT
CTCGAAGCTA TAGATTGGGA AACCTATGCT GCTCCGGAAG GAGACGGCGA GCCGCAGGAA
AAAATCAACG AACAGATTGT CGATCGTTGT GATTGCGCTG TAGGGATTTT CTGGACGCGA
ATTGGTACTG CAACAAAAGT TGCTCCGGGT GGGGCAGTTG AAGAAATCCA GCGCCTTGAA
GAGTCGGGGC GAAAAGTGAT GGTTTATTTT TCAAACCTTC TCATGTCAAG ACAGGCAGCC
AAAGATCCTC AATTTCAAAG AGTCGATAAA TACAGGGAAG AAAGGGAAAA GAAGTCACTG
TGTTGGAGCT ATGGTACTCA TGATGACTTT GAGAAATATC TTTATCATCA CCTCAATATC
CAGATCCCCC GCTGGTTTCC CGAATTATTT AAGTCGAAAA CTACAGTCAA AAGGCCTTCA
AAGACAGATT TTTCCGATCA GCAAGTCTGG CAAAAATACT GTAACAAGCT TCGCAACGAA
CTCAACACCA TTTCATTGCT CGGTTCATCG GTGATTCAGC CGTTTCCTCT CCAGTTGAAA
GATATATTTG TTCCTTTGGA AATGTACGGG GGATCGCATG AAAGCGAGGC GATGAAGCGG
TGTATGGTTG GTCCTGCGAA GGAAAACGTT TTGACGCACA GGCCTGAACA TGTGCTGAAA
GAGACCTATA AGAAGTACAA AACACTATTG GTCATCGGTG ATCCGGGTTC AGGGAAGACA
ACCCTGACAA AATACTATGC GCTTACCTGT CTCGAAGAAA ATCCTCCAGT AACCCTTGGG
TTTGAGGGCT CCGTCAGGGT GTTTTATCTG CCTCTGCGTG ATCTTGAACG AAATAAGAAA
GGGTATATAT CACTGCCATC GAATTTAGCT GCATGGTCAC GACGGAACAA TCTTGATATC
AAGGTCAGGA CCTTTCAGGA ATGGCTTGAT AGCGGAATCT CTCTGGTTCT GCTCGACGGG
CTTGATGAAG TCAGTGATCC TGAACAGAGA AAAGAAATCT GCCGTTGGAT AAAACAGGCG
CAAGGAACGT TCGACAAGGC GAGGTTTGTC GTGACATCGA GACCGACAGG CTATCGTCAG
GATGATGGAA TTACGCTTGA TTTCGACCAT CAGCGCGTGG CGGTCAAAGA CTTTTCTCAT
TCACAGCAGA TCCAGTTCCT CAATAACTGG TACCGGGCGG CATTGTTGCA CGAGATTCGC
CCTGAAGATC TCTCCGAAGG TGAATGGGAG GCGCAGCAGA GTGGTGAAGC AGAAAGGCTT
GCCACAGCGA TGATTGAATA TTTTAAAAAA CCTGAAAACA AAGGTGTACG AGAGCTTGCG
GCAGTTCCTA TGCTTTTACA GATCATGGCG ATCCTCTGGA AAGAGCGCAA GTTTCTTCCG
AACAGACGCC AGGATCTCTA TAGTGCGGCG CTTGATTACC TGCTTGAATA CCGTGACCGT
GTTCGAGATA GAGAGCCCTT ACTCAGTGCG GAGGATACCC GACGGGTACT TGCTCCTGTT
GCTCTCTGGA TGCAGGAAGT GCTCAGTTTT GACGAAGCTG ACCGGAAAGC AATGCACGAA
CAGATGCAGC AGAAGCTGGA TAAACTCGAA GGAAATCACA ACGCAAGGGA TATCTGTCGG
AATCTGGTTG ACCGTACAGG TGTGCTGGTT GAGCATGGAA AGAAAACCTA CATGTTTCGC
CATAAAACAT TTCGGGAATA TCTCGCTGGA GTTCAGCTCA AGGAAGTGTG GTTCGAGCCT
GATCGTATCA GGACGCTCGT TGAGCATTTT GGAGAGGAAT CCGGTTGGTG GGATGAAGTG
ATCAAGTTTT TCATGGCGCA GTCAAACGAG AAGATTTTCG ACCATTTCAT GCGTGAACTG
TTTGCGTCTT CTGCCAGTGT CGATTTTTCA CCGAAACAGA AAAAACTGCT TGCTCAGGTT
ATCGAAGAGT CACCTGAAAA ACGTGTTGAT GCTTTGTGTG AAGCATTGCT CAATGAGAAG
GAAATAAGCG CCTATCGACA ACGATCGATA CTCGATTCGC TCAAATCACT GAATCAGGCG
GCAGCACTCG ATAATTTGCA CCGATTCAAA AAGGACGGCA TAGCGCTGAA TCAGGACATC
AATGAGCTTG CGGAAGATGT CATCCGATCT CTCGAAAAAG TAGCAGGAAT AACTCGAGAT
GACAAAAAGT TAACGAAAGC CGATACAGCC ACACAGGAAA AGACTGGTGA GCCTGAGAGA
TTGATCCGCA ACCCCTACGA ACACGACGCG CAATACATCC TGATTCCCGG CGGGAAGTAC
CTCTATTCGG AGACAAAACG CGAAGTTACC GTCTCCGACC TTTATGTTGC CAAATACCCG
GTGACCAACA AGCAGTACCG CTCGTTCATT GATTTTCTCG CAGGGAAGCC TTCCTTGCAT
GACACCAAAC TCGCCTTGAA AACATATAAG GAGTCATTGC ATGACCTTGC CGGGAGCAGA
GACGATTCGC TCAAGGGATT TCAGGAGTAC CTGAAAGAAA AGCAGAAACT TGCAGGTCTG
TTCAGATCTA AATATGATGA TGACCGGAAG TTCAACAAGG ACGACCAGCC GGTTGTCGGG
GTGAGCTGGT ATGCGGCCCG AGCCTACTGC CTGTGGCTGA CGATGCTTGC AGGTGATAAG
GCAGAATATC GTCTGCCGAC TGAGGTTGAA TGGGAGTGGG CTGCCGGAGG CAGGCGGGAC
AAACCGGATG AGGTGTTGGA AGTTCGGTCA TATCCCTGGG GGGATACACC TGAACCAAGT
TCCAAGTATG CCAATTATGA TGCGACTGAA GGGGCGACAA CACCTGTAGG GCGCTATCCC
GACGGTGCGA CACCGGAAGG GCTGTACGAT ATGGCGGGTA ATGTATGGGA GTGGATGGAG
AACTGGTATG ATGATAACAC AAAGCGATCG AAAGCCTTGC GCGGCGGCTC GTGGAACTTC
CTTGCTGATG ATCTGCTCTG CTCTGCCCGG AACATCTTCG ATCCGCTGAA CAGGAACTAC
AGCTTTGGTT TTCGAGTTGT TCGCCCCAGT CCTCTTGCCT GA
 
Protein sequence
MPDKPHRIRI LIASPSDVQE ERKRAIEVIR QWNASQESVF LEAIDWETYA APEGDGEPQE 
KINEQIVDRC DCAVGIFWTR IGTATKVAPG GAVEEIQRLE ESGRKVMVYF SNLLMSRQAA
KDPQFQRVDK YREEREKKSL CWSYGTHDDF EKYLYHHLNI QIPRWFPELF KSKTTVKRPS
KTDFSDQQVW QKYCNKLRNE LNTISLLGSS VIQPFPLQLK DIFVPLEMYG GSHESEAMKR
CMVGPAKENV LTHRPEHVLK ETYKKYKTLL VIGDPGSGKT TLTKYYALTC LEENPPVTLG
FEGSVRVFYL PLRDLERNKK GYISLPSNLA AWSRRNNLDI KVRTFQEWLD SGISLVLLDG
LDEVSDPEQR KEICRWIKQA QGTFDKARFV VTSRPTGYRQ DDGITLDFDH QRVAVKDFSH
SQQIQFLNNW YRAALLHEIR PEDLSEGEWE AQQSGEAERL ATAMIEYFKK PENKGVRELA
AVPMLLQIMA ILWKERKFLP NRRQDLYSAA LDYLLEYRDR VRDREPLLSA EDTRRVLAPV
ALWMQEVLSF DEADRKAMHE QMQQKLDKLE GNHNARDICR NLVDRTGVLV EHGKKTYMFR
HKTFREYLAG VQLKEVWFEP DRIRTLVEHF GEESGWWDEV IKFFMAQSNE KIFDHFMREL
FASSASVDFS PKQKKLLAQV IEESPEKRVD ALCEALLNEK EISAYRQRSI LDSLKSLNQA
AALDNLHRFK KDGIALNQDI NELAEDVIRS LEKVAGITRD DKKLTKADTA TQEKTGEPER
LIRNPYEHDA QYILIPGGKY LYSETKREVT VSDLYVAKYP VTNKQYRSFI DFLAGKPSLH
DTKLALKTYK ESLHDLAGSR DDSLKGFQEY LKEKQKLAGL FRSKYDDDRK FNKDDQPVVG
VSWYAARAYC LWLTMLAGDK AEYRLPTEVE WEWAAGGRRD KPDEVLEVRS YPWGDTPEPS
SKYANYDATE GATTPVGRYP DGATPEGLYD MAGNVWEWME NWYDDNTKRS KALRGGSWNF
LADDLLCSAR NIFDPLNRNY SFGFRVVRPS PLA