Gene Paes_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_2002 
Symbol 
ID6459847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2199257 
End bp2200462 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content52% 
IMG OID642725985 
Productaminotransferase class V 
Protein accessionYP_002016659 
Protein GI194334799 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.557544 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACAC GACGGGTCTA TCTGGACCAT AATGCCACAA CCCCGCTACA TCCCGAGGTT 
AAAAAAGAGA TGATCGAAGC GATGGAAATG TTCGGTAATC CATCGAGCAT GCATGCTTTC
GGGCGTGAAG CACATGCCAA TATCGAACAT GCGCGTCAAA GCGTTGCTGG ATTTATCGGT
GCGCATGAAG ACGAACTTGT TTTTGTCGGC AGTGGTTCCG AGGCTAACAA TACCGTTCTC
TCACTGTTTG CCTGTGCGTC AAATCTTTGT ATTCCGGGAT TTAAAGCCCG TCAGACAATC
ATTACAAGCC GGATCGAGCA TCCATGCGTG CTTGAAACCT CTCAGTGTCT TGCTCACCGC
GGTACCACGG TGAAGCTGCT CGATGTGGAC CGCTACGGGC GCATCGATAT CGACCAGCTC
AAAGAGTATC TTTCCGATGA TGTGGGGCTT GTTTCGGTGA TGACGGCAAA CAACGAGATC
GGCACGCTCC AGGATATTTC GGAAATCGGT CGTCTGGCCC ATGAAAACGG CTCTCTTATG
CACACCGACG CTGTACAGGC ATTTGGAAAG ATGCCGCTCG ATGTCGATGA GCTCGGTGTC
GATTTTCTGA CGATGAGTGC GCATAAAATT TACGGCCCCA AGGGTATCGG GGCGCTCTAT
GTTCGTAAAG GCACGCCCTA CTGCCCCTTT ATCAGGGGAG GTCATCAGGA GAAGGGGCGA
AGGGCCGGCA CTGAAAATAC GTTAGGGATT ATGGGGCTTG CCAAAGCTGT TGAGATGAGG
GCGCTCGAGA TGGAGGACGA AGCCAGGCGT TTTGCCGCTA TGAAGGAGGT GCTTGTCAAA
GGGATCGAGG AGCGAATCGA TAACGCGTTA TTCAACGGTC ATCCTGAGCT CAGCATGCCC
AATACGGTCA ATGTCTCGTT TCCCGGTGCG GAAGGCGAAG CAATTCTGCT CTATCTCGAT
CTTGCCGGAA TAGCGGTTTC TACCGGTTCA GCCTGTGCGT CGGGATCGCT CGATCCTTCT
CATGTTCTGC TTGCTACCGG TTCGAGTGCA GAACGGGCCC ATGGATCGAT CAGGATCAGT
ATGGGAAGAG AGACGACCAT GGAGGAGATC GAGTATGTGC TCGATGTTCT GCCAGGGGTC
ATATCACGAA TAAGAAGTAT GTCAACAGCT TATATAAAAG GAGAAGAACA TGTTGCAGCA
AAATGA
 
Protein sequence
METRRVYLDH NATTPLHPEV KKEMIEAMEM FGNPSSMHAF GREAHANIEH ARQSVAGFIG 
AHEDELVFVG SGSEANNTVL SLFACASNLC IPGFKARQTI ITSRIEHPCV LETSQCLAHR
GTTVKLLDVD RYGRIDIDQL KEYLSDDVGL VSVMTANNEI GTLQDISEIG RLAHENGSLM
HTDAVQAFGK MPLDVDELGV DFLTMSAHKI YGPKGIGALY VRKGTPYCPF IRGGHQEKGR
RAGTENTLGI MGLAKAVEMR ALEMEDEARR FAAMKEVLVK GIEERIDNAL FNGHPELSMP
NTVNVSFPGA EGEAILLYLD LAGIAVSTGS ACASGSLDPS HVLLATGSSA ERAHGSIRIS
MGRETTMEEI EYVLDVLPGV ISRIRSMSTA YIKGEEHVAA K