Gene Paes_1801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1801 
Symbol 
ID6458941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1966602 
End bp1967888 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content55% 
IMG OID642725785 
ProductRibulose-bisphosphate carboxylase 
Protein accessionYP_002016460 
Protein GI194334600 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1850] Ribulose 1,5-bisphosphate carboxylase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.48391 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCAG ACGATATAAA AGGTTTTTTT GCTGAAAGAG ATCAGCTTGC CATGGCTGAC 
TATCTTGAAC TCGATTATTA CCTGGAATGC ATAGGTGATA TCGAGGTTGC TCTTGCCCAT
TTCTGCAGTG AGCAGTCAAC GGCTCAGTGG AGCCGTATCG GTATCGATGA GGACTTCAGG
GACCTGCACG CAGCGAAGGT TCTTTCATGG GAGCTTATCG GAGAACTTCA GCAGCTCAGC
TATCCGGTCG ACCAGGCTGC TGAAGGAGCG ATCCACGCCT GTCGCGTGAC CATCGCTCAT
CCCCACCGCA ATTTCGGGCC AAAGCTCCCC AACCTCATGA CCGCGGTTTG TGGTGAAGGG
ACCTACTTTA CTCCTGGAGT GCCGCTGGTC AAACTGCTCG ATATCCGTTT TCCCGCCTCC
TATCTGGCCG CCTTCGAGGG GCCGAAATTC GGGATCGACG GTATCAGGGA GATGCTTGGC
GCATACAATC GTCCGATCTT TTTCGGTGTT GTCAAACCAA ATATCGGCCT CAAGCCGGAA
CACTTCGCCG ATATAGCCTA CCAGAGCTGG CTGGGTGGGC TCGACATCGC CAAAGACGAT
GAGATGCTTG CTGATGTTCC CTGGTCATCC ATGGCCCGTC GTTCCGAGCT GCTCGGCAAG
GCACGCCTTG AAGCCGAGGC GCTGACCGGT CAGAAAAAGG TCTATCTGGC CAACATCACC
GACGAAGTCG ACCGCATGAT CGAACAGCAT GATATCGCGG TCAAAAACGG TGCCAACGCC
TTGTTGATCA ACGCTCTTCC CGTAGGGCTC AGTGCCGTGC GTATGCTGAG CCGTCATGCC
AAAGTACCGC TCATCGGTCA TTTTCCCTTT ATAGCCGCTT TTACCCGTCT CGAAAAATTC
GGTGTTCACA CCCGCGTGAT CACCAAGCTG CAGCGGCTTG CCGGCCTTGA CGCCATCATC
ATGCCCGGTT TCGGCAGCAG GATGATGACG TCGGAAGAAG AGGTCCTCAG CAATGTGAAC
GAATGTCTCT GTGATATGGG TCACATCAAA CGTTCGCTGC CCGTTCCCGG AGGCAGTGAT
TCGGCCCTGA CGCTCGAAAA CGTCTACCGC AAAGTTGGCA GTGTTGATTT CGGGTTTGTA
CCCGGTCGCG GTATTTTCGG TCACCCTCAA GGCCCCAAAG CCGGCGCCGC AAGCATTCGT
CAGGCCTGGG AGGCCATTGA ACAGCATATC CCGATAGACA CCTACGCCCA AACTCATCCT
GAGCTCAAGG CGATGGTGGA AAAGTAA
 
Protein sequence
MNADDIKGFF AERDQLAMAD YLELDYYLEC IGDIEVALAH FCSEQSTAQW SRIGIDEDFR 
DLHAAKVLSW ELIGELQQLS YPVDQAAEGA IHACRVTIAH PHRNFGPKLP NLMTAVCGEG
TYFTPGVPLV KLLDIRFPAS YLAAFEGPKF GIDGIREMLG AYNRPIFFGV VKPNIGLKPE
HFADIAYQSW LGGLDIAKDD EMLADVPWSS MARRSELLGK ARLEAEALTG QKKVYLANIT
DEVDRMIEQH DIAVKNGANA LLINALPVGL SAVRMLSRHA KVPLIGHFPF IAAFTRLEKF
GVHTRVITKL QRLAGLDAII MPGFGSRMMT SEEEVLSNVN ECLCDMGHIK RSLPVPGGSD
SALTLENVYR KVGSVDFGFV PGRGIFGHPQ GPKAGAASIR QAWEAIEQHI PIDTYAQTHP
ELKAMVEK