Gene Paes_1854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1854 
Symbol 
ID6460180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2026006 
End bp2027235 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content46% 
IMG OID642725838 
Producthypothetical protein 
Protein accessionYP_002016513 
Protein GI194334653 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACA AGCAGAGCCA TGGACTTGAT ATCTACCAGG CAAAGGCCTG GCGTAACGAA 
CACACCAATG CAGGGTTCAC CAAAGTCATA AGGATTGCTG AAAAAGAGAT CGTTCTGGTC
AGTGTCGAAA CGCAACAAAA CGTCACCATC GATTTCATTG ATGCAGATCT GCTCAATACT
GCCCTTCTCG AATCCGGATG CATCGGTAAA AAGATGATTC TCCTCTGGGA TCTGGAGCAT
GTCGTCAACA TGACCTACCG ATACAAAAAA GATGCAGCAA ACCTTATCTA TAACAGCGAC
AGCTCCTTCA AAGCGATTAT CTTTTTCAAT GTCCGGCCTG AATTCATGAC GACAGTTGAA
ACTTTTGCGG CAATTGTCCA GAAATCGACG TCCATCCGGA TTGTCGATAC ATTTCAGGAA
GCCATGTCTG CAATAGATGA AATTCGATCT GGCAACATCG ACAATGAGGC TGATAACGAT
GAAACGGATG AACTGTTTGA ACAGCGAAAA AAAGAGTTTC TCGCCGTCAC AGGCCGGCTG
AGCTGGCTGA ACATGCTTAA CCAGAATATT AACGTACCCT CACCCGAGGA TCCTGTCTAC
CCATATTTCA AAGCCATAGA AAACCTGCAA TCAGACCTTT CCGAAAACCT TCATCGCGAA
CAACTCGAAA TGGAGCAGAT AAAGAACGAT TGCGAGAGAA TACTGACTGA AAAGACCATC
CAGCTCAACG CGCAGCAGGA ACTCTACAAA CAGTTGAAAC GCCAGCTTGA AAAAGAGAAA
AACACCCTGG CAGCAAGGAT TGCATCGCAG GAAATGGAAC TCACGCGAGT CTCTACAGCG
ATTGCAGAAA AAGCCTCGAC GCTCCAGGAA ATGCGCGACC TCATCAGCGG GCTCGATATC
GATGCCGAGC ACAAGGAAGA GATGATCAGA ACCTGTGAGA GCATGATTGA AACAGAGATG
ATCGAAAAAA AGCTCAACAT AGAGCTGACC ACAACAGATT CCGAATTTCT GCTGAAACTG
CAGAAAAAAC ACCCCAATCT CAACCAGAGA GAACTACGCA TCTGCCTGCT GGTCAAACTC
AATTACGATA CCAAAGAAAT TGCACGTTCG ATTGGCATTT CGACAAGAGG TATGGAAAGT
ATCCGATACA GAATGCATAA AAAAATAGGA CTTACCCGAC ATCAGTCCAT CAAAGGCTAT
CTCACGGAAC TTGCAGTAGC CCAAGCCTGA
 
Protein sequence
MKNKQSHGLD IYQAKAWRNE HTNAGFTKVI RIAEKEIVLV SVETQQNVTI DFIDADLLNT 
ALLESGCIGK KMILLWDLEH VVNMTYRYKK DAANLIYNSD SSFKAIIFFN VRPEFMTTVE
TFAAIVQKST SIRIVDTFQE AMSAIDEIRS GNIDNEADND ETDELFEQRK KEFLAVTGRL
SWLNMLNQNI NVPSPEDPVY PYFKAIENLQ SDLSENLHRE QLEMEQIKND CERILTEKTI
QLNAQQELYK QLKRQLEKEK NTLAARIASQ EMELTRVSTA IAEKASTLQE MRDLISGLDI
DAEHKEEMIR TCESMIETEM IEKKLNIELT TTDSEFLLKL QKKHPNLNQR ELRICLLVKL
NYDTKEIARS IGISTRGMES IRYRMHKKIG LTRHQSIKGY LTELAVAQA