Gene Paes_1352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1352 
Symbol 
ID6460347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1470965 
End bp1472035 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content50% 
IMG OID642725336 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002016021 
Protein GI194334161 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0899347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000140534 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGCAAT TACAGGATTT ACGTGTTTCA AACATTACTC GACTGACTGC ACCGCAAACC 
TTCAAGCAAC GCCTTCCGGT GACGGAAGAG ATTGCCAGAA CGGTTTTAGA CGGTCGCGAG
GAAGTTGAAA ATATTTTATC CGGAAAAGAT TCCCGGATGC TTGTTATCGT CGGTCCGTGT
TCGATTCACG ATATCAAGGC GGCAATGGAG TATGCCGTTC GACTCAAGGC TCTGCGCGAT
GAACTGAAGG ATGATCTTTG TATCGTCATG CGTGTTTATT TCGAGAAACC AAGGACAACG
ATCGGCTGGA AAGGGTTTAT CAACGACCCG CACCTCGATG GTTCATTTGA TATCGAGCAT
GGATTGCATT ATGCCCGCAA ACTGCTTCTT GATATCAATG CCCTGGGGCT TCCTACAGCT
ACGGAGTTTC TCGATCCGTT TACACCTCAG TATGTATCTG ATCTTGTCAG CTGGGCGGCG
ATTGGAGCAA GAACCATCGA GTCTCAGACA CATCGTCAGA TGGCCAGCGG CCTGTCGATG
CCGGTAGGGT TCAAGAACTC CACCGATGGT CGTATTCAGG CTGCCATTGA TGCATTACGG
TCGGCCATGC ATGCTCACAG TTTTCTTGGT ATCGATCAGG AGGGGCACAG CAGTGTCATC
ACCACGACGG GCAATCCGTT TGGCCATATT GTGCTGCGTG GCGGTTCCCA GAAGCCGAAC
TACGATCCGG ACAATATTGC CGACGCTGAG CGGAGGCTGC AGGCAGCACA TCTGCCATCT
GCCATTATGG TTGATTGCAG TCATGCCAAT TCGGGGAAAA AGCATGAACA GCAGGCCAAC
GTCTGGGATA ATATTGTCGA ACAGCGCGTC AACGGTACGA CAAGTATCAT CGGCGTGATG
ATCGAAAGTA ATCTGTTCTG CGGAAATCAG CCTTTTCCTG ACGATCCATC TTCCCTGCAG
TACGGTGTTT CGATTACCGA TGCCTGCATT GCATGGAATG AAACTGAGAC GCTCTTAAGG
AAAGGCGCGG TGAGACTTCA TGAAGTGCTT CGAAAATCTG AGCTTTCTTA A
 
Protein sequence
MQQLQDLRVS NITRLTAPQT FKQRLPVTEE IARTVLDGRE EVENILSGKD SRMLVIVGPC 
SIHDIKAAME YAVRLKALRD ELKDDLCIVM RVYFEKPRTT IGWKGFINDP HLDGSFDIEH
GLHYARKLLL DINALGLPTA TEFLDPFTPQ YVSDLVSWAA IGARTIESQT HRQMASGLSM
PVGFKNSTDG RIQAAIDALR SAMHAHSFLG IDQEGHSSVI TTTGNPFGHI VLRGGSQKPN
YDPDNIADAE RRLQAAHLPS AIMVDCSHAN SGKKHEQQAN VWDNIVEQRV NGTTSIIGVM
IESNLFCGNQ PFPDDPSSLQ YGVSITDACI AWNETETLLR KGAVRLHEVL RKSELS