Gene Paes_0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_0104 
Symbol 
ID6458551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp103392 
End bp105353 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content58% 
IMG OID642724091 
Productalpha amylase catalytic region 
Protein accessionYP_002014811 
Protein GI194332951 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAGC AGAGCAATGA GCACCATACC ATCGAAAGAC GCCTCGATCA GATAGATCTC 
AACGAGCTTT GCCGCGGCAG AAGCTTTCAC CCGTCGCCCG TATCGTGGGG TGACGAAGTC
CTCTATTTCC TGTTTCTCGA CCGGTTCTCG GACGCAAAAG AGTACGGAGG ATTCACCGAC
CGGGAAGGCA GGCCTGTCGA TAGCGGTGAA AACGGACGTT CGACGCCGCT CTTCAACTTC
GAAGATGATG CTGCTCGCGC ATCGAGGAGT GAATGGTTCG AATCAGGAAA AGGATGGTGC
GGCGGCACTC TTGCGGGTCT GCGTGACAAA CTTGGCTACC TGAAACGGCT TGGCATCACA
GCACTGTGGA TCAGTCCCGT CTTCAAACAG GTCACCGGCA GCAACGACTA TCACGGCTAC
GGCATCCAGA ACTTTCTTGA CGTTGATCCG CATTTCGGAA CCCGGCAAGA GCTCAGAGAA
CTCGTGCAGG CAGCGCACGC TGAAGGCATA CGGGTCATTC TCGATATCAT CGTCAACCAC
GCTGGAGACG TCTTCGCCTA TAGCGGCAAC GAAAGGCGCA ACTACAATAA CGGAGTGGAA
TTTCCCGTGC AGGGGTTCCG TCGATTCAGC GGCGAAGAAG GAAGCATTCC ATTCAGGACC
GTGAGCCCTG AAGAAGAGCA GGAGCTCTGG CCTGACGGAG CCGTCTGGCC TGCGGAGCTG
CAAGAGCCCG GCAACTGGCG GAAAAAGGGC GAAATCGGCA ACTGGGACGG GTTTCCCGAC
TATGTCGAGG GTGATTTCCT TTCGCTCAAA GACCTGCATC TCGGCAATGG GATCAGCGAC
CCCTCGGCCG GTCAGGATAT CGGCAGAAGA ATCAGCGGAT TCTCTCCGTC CGAAACCCTG
GCGCACCTTA TCAAGGTCTA CCGCTTCTGG ATTGCCTATG CAGACATCGA CGGCTACCGT
CTCGACACCG TCAAACACAT GGAACCCGGA GCGGTCCGTC TCTTTGTCAA CGCCATTCAT
GAATTCGCCC AGTCGGTCGG CAAGGAGAAC TTCACCGTTA TCGGAGAAAT CACGGGAGGA
CGCGCTCTTG CCTTCGAAAC GCTTGAAACC ACCGGCCTTG ATGCAGCACT CGGCATCAAC
GACGTTTCCG ACAAGCTCGA ATTTCTCGCT AAAGGGTGGC GCAGCCCCGG CCACCCCGAA
ACACCTGAAC AGGAAGGCTA CTTCGATATC TTCCGCAACA GTCTCCAGGA CTGCAAGAGC
AGCCATCAGT GGTATGGCAA TCATATCGTC ACCATGTTCG ACGATCACGA CCAGGTCGGC
GTACGGCACA AATTCCGTTT CTGCGGACAG GGAGAACAGA GCTACACCCT GCTGCCGGCA
GCACTCGGGC TCAATCTTGC GACGATGGGT ATCCCCTGCC TCTACTACGG AACCGAACAG
GCCTTCAACG GCGCCGACCA CCGCGACAAC GACGACTCGT ACAGTGATGT CTTCCTGCGG
GAGTGCATGT TCGGAGGAGC GTTCGGATCG ATGCAGAGCA CCGGACGCCA CTTCTTCAAC
GAATCGCATG AGATCTACCG CTTCATCAGT CGCCTTTCCG CCCTGAGATC GCAACACCTT
GCGCTGAGAC GGGGACGTCA GTACCTCCGG CAGGTATCAG CCTCGGGTCA TGACAACGAC
TTTCACTACC CGCAGCCCCT TGGCGGCGAA CTGCGCTGGA TCATCGCATG GTCCCGCATC
TTCGCAGACC GGGAGTACCT CTGTGCCTGC AATACCGACC CGCGCCACCC CCTGACGCTC
TGGGCCACAG TGGACAGTTC GCTCCACCGC TCCGGCGAGA CCATGAGCTG CCTGTTCAGC
TCGGACCGGG AGCAGGAAGG AACATCCTCT GACATTGAAG CACGCAACGG CAAGGCGATT
GCCGTAACAG TGCCTCCGGG AGGGTTCGTG GTCTATCATT GA
 
Protein sequence
MTEQSNEHHT IERRLDQIDL NELCRGRSFH PSPVSWGDEV LYFLFLDRFS DAKEYGGFTD 
REGRPVDSGE NGRSTPLFNF EDDAARASRS EWFESGKGWC GGTLAGLRDK LGYLKRLGIT
ALWISPVFKQ VTGSNDYHGY GIQNFLDVDP HFGTRQELRE LVQAAHAEGI RVILDIIVNH
AGDVFAYSGN ERRNYNNGVE FPVQGFRRFS GEEGSIPFRT VSPEEEQELW PDGAVWPAEL
QEPGNWRKKG EIGNWDGFPD YVEGDFLSLK DLHLGNGISD PSAGQDIGRR ISGFSPSETL
AHLIKVYRFW IAYADIDGYR LDTVKHMEPG AVRLFVNAIH EFAQSVGKEN FTVIGEITGG
RALAFETLET TGLDAALGIN DVSDKLEFLA KGWRSPGHPE TPEQEGYFDI FRNSLQDCKS
SHQWYGNHIV TMFDDHDQVG VRHKFRFCGQ GEQSYTLLPA ALGLNLATMG IPCLYYGTEQ
AFNGADHRDN DDSYSDVFLR ECMFGGAFGS MQSTGRHFFN ESHEIYRFIS RLSALRSQHL
ALRRGRQYLR QVSASGHDND FHYPQPLGGE LRWIIAWSRI FADREYLCAC NTDPRHPLTL
WATVDSSLHR SGETMSCLFS SDREQEGTSS DIEARNGKAI AVTVPPGGFV VYH