Gene Paes_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_2021 
Symbol 
ID6459806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2219245 
End bp2220387 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content55% 
IMG OID642726004 
Producthypothetical protein 
Protein accessionYP_002016678 
Protein GI194334818 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0664622 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGGCTC GACCCTCACC AAAAGCGTTC TTCATAATCC TGCTGCTGCT GACAGTCGGG 
TGCGCCACTG ACCGCCCTCC TTCCGGAGGC CCTCCGGATA ACGCTCCCCT GCGGATAACA
GACGTCCAGC CTGAAGCGTC TTTAACCGAT ATTCAGCCAG AAACGATCCG TTTCACCTTC
AACCGTTATG TCCCCACAGC ATCCCTGAGG CGCTCGATAG TTTTTTCACC AAGGATCACC
GGCTATGAAA TCAGGGGTGA TGGCGAAGAA GCTGTCATCA TCTTCAACGA GCCTTTTGAG
CAGAACAGAA CCTATTCGAT CTCATTCAAC ACATCCCTGC AGAGCAGCCG CGGCAACGAA
CTTGAAAAAA GTTACACCTA CGCTTTTTCA ACCGGTCCCT TTCTCGACAG CGGCGAAATC
GAAGGAACCG TCTATACCAG GGAGAACAAA CCGGCCAGAG GCGCATTGAT CTATGCGTTT
CTTCGAGAGC CTCAACAAGC GCAGGCAGAA CAATCAATCC TTGAACGGCA CCCGGACTAC
GTCGTCCAGA CAGGTACGGA CGGAACCTTC CGCTTCAATC ATCTGAAAAA GGGCAGCTAC
CGGCTCATGG CCTTCATGGA TAAGGACGGT AACCGGATAC TGAACCTGAA CCATGAAGCG
CATGCCTCCG GCACGATCGA AAACGTTCCG ACAGGCTCCC GGCCGCTGCT GTTCAGAATG
TCGTCGCCAC ACGAGGAGAA GCGCCAATCA GCAGCAGCTG GCAAAAACGC ACCCCAGCCG
ACGGACCCCG GCGCCATTAT CGGAACGATT CGGACCCTGC ACCATGCGGC TGTCATCGAA
GCAGTCAATA TGACGACAGG CACATGGTAC CGGACAACCG CCGTCAACAC CCGGCATAGA
GAACAATCGT TTGTGCTGAA AAATCTTCCC GCAGGGCGCT ATCTCGTCAG TGCCTATCTG
CCCGGAAAGG ATATTGCTGC TGACGGAAGC ATCCCTCAAT GGAGCCCCGG CAATATTTGG
CCATTCAGGC CCGCAGACGA GCTTGTCATC CACCCCGATC CGGTTATCGT CCGCAAGGGC
TGGACGACAG GCAACATTGA GCTGAACCTG CAGCCATCAG CGCTGAGAGG AAAAGAGAAA
TGA
 
Protein sequence
MKARPSPKAF FIILLLLTVG CATDRPPSGG PPDNAPLRIT DVQPEASLTD IQPETIRFTF 
NRYVPTASLR RSIVFSPRIT GYEIRGDGEE AVIIFNEPFE QNRTYSISFN TSLQSSRGNE
LEKSYTYAFS TGPFLDSGEI EGTVYTRENK PARGALIYAF LREPQQAQAE QSILERHPDY
VVQTGTDGTF RFNHLKKGSY RLMAFMDKDG NRILNLNHEA HASGTIENVP TGSRPLLFRM
SSPHEEKRQS AAAGKNAPQP TDPGAIIGTI RTLHHAAVIE AVNMTTGTWY RTTAVNTRHR
EQSFVLKNLP AGRYLVSAYL PGKDIAADGS IPQWSPGNIW PFRPADELVI HPDPVIVRKG
WTTGNIELNL QPSALRGKEK