Gene Paes_1338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1338 
Symbol 
ID6460630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1459087 
End bp1460106 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content55% 
IMG OID642725322 
ProductApbE family lipoprotein 
Protein accessionYP_002016007 
Protein GI194334147 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.337899 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.41849e-06 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCGACCAC GCTATTTCTC ATTCCCGGCG CTCTTCATCC TTTTCCTGAC GCTCCTTGCC 
TGCTCGGGCA GGGGCGACGA CCTGCGGATT TACGAGCAGG AGAAGGTGAT GATGGGGACG
ATCATGAAGA TCAAGGCGGT TGCTCCCGGC AATGCGGAGG ACAGTACGCG AGCGGCGTTT
GAGGCGGCGT TCGGGGAGAT GTCGGAGCTT GAATCCGAGC TGAGCGAGTG GCAAAGTACG
AGCGCGGTGT CGGCGGTGAA CCGGGAGGCG GGCGTGAAGA GCGTGAAGGT GCCGGACGCG
GTTGTGACCG TGACAGAGAA GGCGCTGGAG ATCGCGACAA TGACAGATGG CGCGTTTGAC
GTAACTTTCA AACCGGTGGG TCAGCTGTGG AACGTCAAGG AAAGGACTGC CCCGCCGCCT
CAGGACAGTA TTGCAACAGC GTTGTCGCTT GTCGATTACC GGCAGATCAG ACTCGACAGA
GCAAAGCGCA CGCTCTACCT GACGAAAAAG GGAATGGAAA TCGGTTTCGG GGGAATTGCG
AAGGGATATG CTGCGTGGCG GGCTGGTGAA GTGCTGAAAA AGCACGACAT CCGTGATTTT
ATTATCAATG CCGGAGGCGA TCTCTATGTC GAGGGGAAAA AAGGTGAACG GTTCTGGACG
TCTGGCATAA AAAATCCCGA TCAGGACAAC GCGAAACCTG TCACCACGTT CAATGTGATT
GCGACATGCG GTGTGGCAAC GAGCGGGGAT TATGAGAATT TCTTCACCTG GAAAAGCGAA
CGCTACCACC ATATCATCGA TCTGAAAACG GGCTATCCGG CGAAAGGAAT GAAAAGCTCG
ACGGTGTTTT CAAGCGATCC GGCAAAAGCC GATGCCTATG CGACAGCGTT CTTCATCATG
GGATATGAAA AAGCCCTGGC GGTTGTCGCG GAGGACCCGT CCGTGGCGTT CATTCTGATC
GACAGCGACA ACAAGGTCAT GAGAAGCCCG AATCTCGACC AGTTCATTCA GGAACACTGA
 
Protein sequence
MRPRYFSFPA LFILFLTLLA CSGRGDDLRI YEQEKVMMGT IMKIKAVAPG NAEDSTRAAF 
EAAFGEMSEL ESELSEWQST SAVSAVNREA GVKSVKVPDA VVTVTEKALE IATMTDGAFD
VTFKPVGQLW NVKERTAPPP QDSIATALSL VDYRQIRLDR AKRTLYLTKK GMEIGFGGIA
KGYAAWRAGE VLKKHDIRDF IINAGGDLYV EGKKGERFWT SGIKNPDQDN AKPVTTFNVI
ATCGVATSGD YENFFTWKSE RYHHIIDLKT GYPAKGMKSS TVFSSDPAKA DAYATAFFIM
GYEKALAVVA EDPSVAFILI DSDNKVMRSP NLDQFIQEH