Gene Anae109_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1643 
Symbol 
ID5376242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1847666 
End bp1848940 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content72% 
IMG OID640843152 
Productsulfate adenylyltransferase, large subunit 
Protein accessionYP_001378831 
Protein GI153004506 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00485] translation elongation factor TU
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGCCG CCGAACGACT CCCCGCGCCC GACGCGGACA GCCTGCTCCG CTTCCTGACC 
TGCGGCAGCG TCGACGACGG GAAGAGCACG CTCATCGGGC GGCTCCTCCT CGACACCAAG
TCCATCCTCG CCGACGCGCT GCACGCGATC GAGCGCACCT CGCGCAAGCG CGGCCTCGAG
GCGGTCGATC TCTCGCTCCT CACCGACGGG CTGCAGGCGG AGCGCGAGCA GGGCATCACC
ATCGACGTCG CGTACCGCTA CTTCTCGACC GGGACGCGCA AGTACATCAT CGCCGACGCG
CCGGGCCACG AGCAGTACAC GCGCAACATG GTCACCGCGG CCTCCACCGC GAGCCTCGCC
GTGATCCTCG TCGACGCGCG CAAGGGCGTG CTCACCCAGA CGCGGCGCCA CTCGTACCTC
GCCCACCTGG TCGGCATCCC GCACCTCGTG GTGGCGGTGA ACAAGATGGA CCTCGTCGGC
TGGTCCTCCG AGGTCTTCGA GCGGATCCAG GCAGACTACC TGGCGTTCGC GGAGAGGCTC
GGGATCGAGG ACGTGCGCTT CATCCCCATG TCCGCGCTCG AGGGCGACAT GGTCGTGGAG
CGCGGCCAGA ACCTCGGGTG GTACCGCGGG CCGACGCTGC TCGAGCTGCT CGAGGAGGCC
CCTCCCGGCC ACGCCGAGGC TCCGGAGCCG TTCCGCTTCC CCGTCCAGTG GGTCTGCCGG
CCGCAGACCC TCGAGCACCA CGACTTCCGC GGCTACATGG GCCGGGTCGA GTCGGGCGAG
ATCCGGGTGG GCGACGCGGT GCAGGTGCTG CCCTCCGGCC GCTCGACGCG CGTGAAGGAG
ATCCGGCTCC TCGACGCGTC GCTCGCGAAC GCGGTGAGCG ATCAGTCCGT GACGCTGCTC
CTCGAGGACG AGCTCGACGT CTCGCGCGGC GACCTCCTGG TCCGCGCCGG CGAGGCGCCC
GAGCCGACCC GCAAGGTCGA GGCCATGCTG TGCTGGCTCT CGGAGCGGCC GCTCGCGACC
GGGCGCCGTT ACCTCGTCCG CCACACGACG CGCGAGGCGC GCGCCACGAT CTCGGAGATC
GCCTTCCGGG TGGACCTCGC GGAGCTCGGC GAGCGGCCCG CCGACACGCT CGCCATGAAC
GACATCGCGC GCGTGTCCCT CCGGCTCGCG CAGCCGATCG CCGCCGACCG GTACGCGGTC
AGCCGCGCCA CCGGCGCGCT CATCGTGATC GACGAGGCCA CGAACGACAC GGTCGCGGCG
GGGATGATCC TGTGA
 
Protein sequence
MLAAERLPAP DADSLLRFLT CGSVDDGKST LIGRLLLDTK SILADALHAI ERTSRKRGLE 
AVDLSLLTDG LQAEREQGIT IDVAYRYFST GTRKYIIADA PGHEQYTRNM VTAASTASLA
VILVDARKGV LTQTRRHSYL AHLVGIPHLV VAVNKMDLVG WSSEVFERIQ ADYLAFAERL
GIEDVRFIPM SALEGDMVVE RGQNLGWYRG PTLLELLEEA PPGHAEAPEP FRFPVQWVCR
PQTLEHHDFR GYMGRVESGE IRVGDAVQVL PSGRSTRVKE IRLLDASLAN AVSDQSVTLL
LEDELDVSRG DLLVRAGEAP EPTRKVEAML CWLSERPLAT GRRYLVRHTT REARATISEI
AFRVDLAELG ERPADTLAMN DIARVSLRLA QPIAADRYAV SRATGALIVI DEATNDTVAA
GMIL