Gene Paes_0918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_0918 
Symbol 
ID6460798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1008333 
End bp1009532 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content48% 
IMG OID642724921 
Productinternalin-related protein 
Protein accessionYP_002015608 
Protein GI194333748 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.482168 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAACAAA ATACTTATAA CAGATGCCCG GTCTGTCAAT TTCCTCTGAC CGAAAAGAGT 
GCCGTATGTC CCCGTTGCGG TAACGATATC CTTGCCGATA TCACCTCGTT AGACCAGCAA
TCACGGGAAA AACATCTGAA AATTATCGAA GAAAAGCAGG CTGAATGGTA TGCTCGATGT
GTTACCGACA ATCTCAACTG CTGTGAAATG CAACCATCAA CGCTCAGCCA AGCGGATTGC
GCAAGCCACA GAACCAGAGA AGGAAGCGGA AAACTTGTCG ACGCAGAAAT CTTTTCTCTG
GCAAAACAAC AGCTTCTGGA GGATCACAAC AGGATCGTTC AATGGTGGGA ATCGCTGAAC
AGCGACTGGA AAGAGGTGAT CAAAAACACG TTAAAAACGT CAGACGATCC ATCAAACCAG
GAGCTCATCA AGTTTTTTGA TATAACGAAC CTGCGATGCG ATAACCGAAG GATTCACAGT
CTTGCACCGG TTAGGGTCCT TGAAAACCTG CAACAGCTTC GCTGTGATGA ATCTCCTGTC
GAAAACCTCG AACCGCTTTC TGCAATGCGC AAGCTCCAGC GGCTCTATGC GTTCGACTGC
GACTTCATTT CTCTTGAACC ACTCCGCAAC GTCACGAGTC TGAAACTGCT CTGGATCTCC
AGCACAGAGG TCAGTGATCT TTCACCGATT GAAGGACTTG TCAATCTCGA AGAACTCTAC
TGTTCGGAAA CGCCGGTGTC AGACCTTTCG CCAGTCGCAG AGCTTACCAA GCTTGAAAAG
ATAAGCTGTT ACAAAACCGC AATAGCCTCT CTCAAACCGC TTGCCAGGCT TGAAAATCTT
ATAGAACTGG GCTTCAATCA TACACTGGTA ACCGATCTCG ACCCGCTGAC AAACCTGGAA
AATCTGGAAT ATCTCCGTTT CAGCAATACT GCGATCAGCA GCCTTGATCC GCTGGCGCAT
CACATCAACC TGCGCGAACT GAGCTTCAAC GACACCGGAA TCACCACTCT CGAACCGCTG
GCGTCGCTCC CCGAACTCGA AGAGGTCAGC TTCGCTGCTA CAGCCGTTTC ATCGATAAAA
CCGCTGATGG AGCTTGAATA CATCGAGAAA ATAGAACTCT CGAAAAATCA GATTCCAACT
GACGAACTGG AACAATTCAT GGACGCCCAC CCTGACTGTG AAATAGTGAT AAGGAAATAA
 
Protein sequence
MEQNTYNRCP VCQFPLTEKS AVCPRCGNDI LADITSLDQQ SREKHLKIIE EKQAEWYARC 
VTDNLNCCEM QPSTLSQADC ASHRTREGSG KLVDAEIFSL AKQQLLEDHN RIVQWWESLN
SDWKEVIKNT LKTSDDPSNQ ELIKFFDITN LRCDNRRIHS LAPVRVLENL QQLRCDESPV
ENLEPLSAMR KLQRLYAFDC DFISLEPLRN VTSLKLLWIS STEVSDLSPI EGLVNLEELY
CSETPVSDLS PVAELTKLEK ISCYKTAIAS LKPLARLENL IELGFNHTLV TDLDPLTNLE
NLEYLRFSNT AISSLDPLAH HINLRELSFN DTGITTLEPL ASLPELEEVS FAATAVSSIK
PLMELEYIEK IELSKNQIPT DELEQFMDAH PDCEIVIRK