Gene Paes_1339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1339 
Symbolpgi 
ID6460708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1460546 
End bp1462192 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content54% 
IMG OID642725323 
Productglucose-6-phosphate isomerase 
Protein accessionYP_002016008 
Protein GI194334148 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0166] Glucose-6-phosphate isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0803183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000158602 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAGTTTAC AGCAATCACC GGCCTGGCGC GCTTTGGCAT CGCACAAACA GACACTGGAT 
TGTGTTCTGA TGCGAGACCT GTTTGCCGAT GACCCCGAAC GGTTCAGCAG GTTTTCCATC
CGCTCAGGTG ATCTGCTTCT CGATTATTCC AAGAACAGGG TGACGTCCGA AACCATGGCC
CTTCTTGCCG ATCTGGCCCG GGAGGCCGGC GTGGAAGCCT GCCGCGACGA GATGTTTGCA
GGCAGCAAAA TCAACTTCAC TGAAAAAAGG GCCGTGCTGC ATACCGCTTT GCGACAGCCT
TGGGGCTTTC AGCTTCTGCT GGAAGGCGGT CAGGAGGTCG GAAAGGATAT TGCAGCAGTG
CTCGGGCAGA TGAAAGGGTT TACCGAGCAG ATTCTCAGCG GAAAGTGGAA AGGATACAGC
GGCAAGGCTA TTACCGATGT GGTCAATATC GGGATTGGTG GTTCGGACCT GGGGCCGTAT
ATGGTTACCG AAGCGCTCAG ACCGTTTGCG CATGGCGCCG TCAAGGTGCA TTTCGTGTCA
AACATCGACG GGACGCACAT CAGCGAGACC CTGAAGCGGG TCGATGAGGA GACGACTTTG
TTTATTATAG CATCAAAGAC CTTCACGACG CAGGAAACAC TGACCAACGC CCATACCGCC
AGATCGTGGT TTCTTGAGCG AGCCGTTGAC GAATCGTTTA TTGCGCGTCA TTTTGCAGCC
GTTTCAACCA ATAAGGAGGC CGTTCAGGCA TTTGGTATCG ATACCGGCAA CATGTTCGGT
TTCTGGGACT GGGTCGGCGG ACGTTACTCG CTCTGGTCGT CTATCGGCCT TTCAATTGCC
TTGTATCTGG GCTTTGATCG CTTTCTCGAA CTGCTGGCAG GAGCGCACTC GATGGATGAA
CATTTTCGTT CCGCTCCGCT TGACGCAAAC ATTCCGGTCG TGCTTGCGCT GCTGGGCATC
TGGTACAACA ACTTTTTCGA TTTCCCTTCT CATGCAGTCA TTCCCTATGA TCAGTATCTG
CATCGGCTGC CGGCCTACCT TCAGCAGCTC GATATGGAGA GTAACGGCAA GCGGGTCGAT
CGCGACGGCA ATGTGGTCGA TTATGCGACC GGTCCGGTCA TCTGGGGCGA GCCGGGCACG
AATTCACAGC ATGCGTTTTT TCAGCTGATG CATCAGGGGA CGTCATGTGT GCCGGCTGAT
TTCATTTTGC CTCTGAAGAC GCAGAACCCG GCAGGGGAGC ATCACGATAT TCTCGCCGCG
AACTGTTTTG CCCAGACTGA AGCGCTCATG AAAGGCAAAA CCGCAGCAGA GGCGCGAGCG
GAACTCGGTT CGGCAGGAAT GAGCGAAGAG GAGATCGACG CACTCGTGCC GCACAAGGTT
TTTCCCGGCA ACCGCCCGAC CAATACCCTG ATTTTCAATG AGATCAATCC GTTCAATCTC
GGCGCTCTTA TTGCGATGTA TGAACACAAA GTGTTCGTGC AGGGCGTGAT CTGGAGAATC
AACTCATTTG ACCAGTGGGG GGTCGAGCTT GGCAAGCAGC TGGCAAAAGC CATACTGCCT
GAACTTGGTT CCGCTGATGA CGTCGCAACC CACGACGCTT CAACTAACGC GCTCATCAAT
CTCTACCGCC GTAGCAGGAA TGGCTGA
 
Protein sequence
MSLQQSPAWR ALASHKQTLD CVLMRDLFAD DPERFSRFSI RSGDLLLDYS KNRVTSETMA 
LLADLAREAG VEACRDEMFA GSKINFTEKR AVLHTALRQP WGFQLLLEGG QEVGKDIAAV
LGQMKGFTEQ ILSGKWKGYS GKAITDVVNI GIGGSDLGPY MVTEALRPFA HGAVKVHFVS
NIDGTHISET LKRVDEETTL FIIASKTFTT QETLTNAHTA RSWFLERAVD ESFIARHFAA
VSTNKEAVQA FGIDTGNMFG FWDWVGGRYS LWSSIGLSIA LYLGFDRFLE LLAGAHSMDE
HFRSAPLDAN IPVVLALLGI WYNNFFDFPS HAVIPYDQYL HRLPAYLQQL DMESNGKRVD
RDGNVVDYAT GPVIWGEPGT NSQHAFFQLM HQGTSCVPAD FILPLKTQNP AGEHHDILAA
NCFAQTEALM KGKTAAEARA ELGSAGMSEE EIDALVPHKV FPGNRPTNTL IFNEINPFNL
GALIAMYEHK VFVQGVIWRI NSFDQWGVEL GKQLAKAILP ELGSADDVAT HDASTNALIN
LYRRSRNG