Gene Paes_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1031 
Symbol 
ID6460062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1133869 
End bp1134885 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content51% 
IMG OID642725031 
Productbasic membrane lipoprotein 
Protein accessionYP_002015717 
Protein GI194333857 
COG category[R] General function prediction only 
COG ID[COG1744] Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.329885 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTT CAGTTGCATC GCTGCTCCTG GTTCTTTTTT TTATCGCGGG CTGTTCATCA 
GAGTCTCCGT CCGGGGAGGA TTCCCCCTCA TCGAAGATGA CTGTCGGTAT GGTTTTCGAC
GTTGGAGGAA GAGGCGACAA GTCATTCAAT GATGCTGCGT ATCACGGCCT TGAACTTTCC
AGAGATTCTC TCGGTATCGA TTTTGTTTAC ATTGAGCCTT CCGGAGAAGG GGCTGACCGC
GAGGCTGCTC TGCGCAATCT TGCCGCTGAT GCGGATGTGA AGCTTATTTT TGGCGTTGGT
CTGCTTTTCA GTCGCGATAT AGTAGCCATC GCAAGGGAAT TCCCAGATAA ATATTTTGCC
TGCATCGATT ACGTTGGTGA AGATGGTGAG GTTTTGCCGC CAAACCTTTC CGGTATTGTT
TTCGACGATC GTAAAGGCTC TTTTCTGGCC GGTTCCATGG CCGCTATGGT CACCCGGACC
AGGACGATCG GTTTTATTGG CGGAATGGAA TCGAACGTCA TCAGGCGGTT TCATGACGGT
TATGTCGCGG GAGCGAAATC TGTCGACCCT GATATTACGA TTATTTCCGG CTACATTGGT
ATGACGGGTA GTGCATTTAC CAATCCGTCG AAGGGGAAGG AACTTGCACT TGGACAATTC
AGCAAGGGTG CTGATATTAT TTATCAGGCA GCAGGAGCAA GCGGGCTCGG CGTCGTCGAG
GCGGCTCGAC AGACCGGAAA GCTGGTCATC GGTACGGATC GTGATCAGGA GTATCTCGCT
CCGGGGCACG TCCTTTCAAG CATGATCAAA GGCATTGACC GGGCTGTTCT GAGAAGTGTC
GAACAGCTGG TCAGCGATAG CTTTCCTGGC GGAAGCGTTA CGGTGTATGG TCTTGACGGG
CGCTATACTG ACTATGTGTA TAATGCCGAA AATGCGGCTT TGGTTGGTGA TTCAGTACAT
GCAAGACTTG AATCAATCCG CAGTAAACTC ATTGACGGCT CGCTGTCGGT CGACTGA
 
Protein sequence
MKFSVASLLL VLFFIAGCSS ESPSGEDSPS SKMTVGMVFD VGGRGDKSFN DAAYHGLELS 
RDSLGIDFVY IEPSGEGADR EAALRNLAAD ADVKLIFGVG LLFSRDIVAI AREFPDKYFA
CIDYVGEDGE VLPPNLSGIV FDDRKGSFLA GSMAAMVTRT RTIGFIGGME SNVIRRFHDG
YVAGAKSVDP DITIISGYIG MTGSAFTNPS KGKELALGQF SKGADIIYQA AGASGLGVVE
AARQTGKLVI GTDRDQEYLA PGHVLSSMIK GIDRAVLRSV EQLVSDSFPG GSVTVYGLDG
RYTDYVYNAE NAALVGDSVH ARLESIRSKL IDGSLSVD