Gene Acid345_0829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0829 
Symbol 
ID4072355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1028854 
End bp1030485 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content57% 
IMG OID637982838 
Productsulfate adenylyltransferase subunit 1 
Protein accessionYP_589908 
Protein GI94967860 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00485] translation elongation factor TU
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACAG TGTTCACTCT TCGCGAAGAC AATGCCGATT CTGATATCCA GAGTTTTCTG 
GAGCAGGAAC AAGCCAAGGA CATTCTGCGC ATCAGCACTG CAGGCAGCGT GGACGACGGC
AAGTCCACAC TCATCGGTCG CCTGCTCTAT GACTCGCGCA ACGTTTATGA GGACCATGTT
CGTTCGGTGA CGCGCCACGA TGTTTCGCTT GGCACATCGG TGGTGGATTT TGCACAACTG
ACAGATGGGC TTCGTGCCGA ACGCGAGCAG GGAATCACTA TTGACGTAGC GTATCGCTAC
TTTTCAACTG CGAAACGGAA GTTCATCATC GCCGACACTC CGGGCCACGA GCAGTACACG
CGCAACATGG CTACGGGTGC TTCGACTTCC GATCTCGCGA TCGTGCTCAT CGATGCGCGC
AAAGGGGTGC TCGTTCAATC GCGCCGTCAC CTCTACATTG CCGCCCTGCT GGGGATCCCG
CGGGTGGTGG CGACAATTAA CAAGATGGAC CTGGTGGATT TTTCGCCCGA GGTGTTCGCC
GCCCACTCGC TGGAATTGAA GCGCCTCGGC GATGGACTCG GGATACCGAG TCTCGTCACG
ATTCCCATCA GTGCATTGGA TGGAGACAAC GTCGTTGAGA CCAGCGCCCG CACTCCTTGG
TACGACGGCC CAAGTTTGTT GCAGTTTCTG GAAACCGTTC CGGTAGAAAA CGCTTCGGAG
GTCGCGTTTC GTCTTCCGGT GCAGCGCGTG CTTCGCCCAC ACCAGGAGTA CCGTGGCTTT
GCGGGACAGA TCGCCGCGGG CGCAGTGCGG CCCGGGGATC AAGTGGTTGT CTTGCCGTCA
GGCCGAAGCA GCCGAGTGCG TTCCATCACC ACGTTCGATG GAGACTTGCC ATCCGCTGAA
GCCCCACTTT CCATCGCCCT GACACTAGAG GATGAAGTGG ATGTCAGCCG TGGGGACGTG
ATCGCAGCGG CAGATGCGCC TCCCACGGTG GCAAAGCGGT TCGAGGCCTC GCTCGTCTGG
ATGGATTCAG CTGAGTTCCG TCCCGCCAAG CGATATCTGC TGAAACACAC CGCCCAGGTG
GTGCCCACCA CCGCAATCAC AATCAAAAAC CGAGTGAATG TGCAGACGTT TGAAACGGAA
CAATCGTTCG TGTTGCACAT GAACGACATC GCGATCGTGG AGATCGAAAC CAAGCGCCCC
CTGGTAGGCG ATTCATACCG CGACAATCGC ACTACGGGGA GCTTCATCTT GATCGATCCG
GAGACGAACT CCACCGTCGC TGCCGGAATG ATCCGTTCGT TTGTCGTTAA CCCATCACTA
AGCAAACATC ACGTTCCTAT CGTTGAACTG CGCAAAGGAG CTGTCGTGGA CGCGGTCGAG
CAGAAATTGT TCGATCGTGG ATACCTGGTG GTTCGCACGA AGCTCGATCC TGACCGGTTG
CTGCAGCGCT TGCATCTGCC TCGACTGGTC CTGCTCCTGG AAGGCCCTGA CGTTCACTAT
TCCCGGATCC GCTCTTTCTC ACGTCAAACG CGTTTGTCCG TGGAGTACAG AGTCGAGACA
TCCGCTACAC CCGACAGCAT TGTTGAAAAG ATCGTTGCGC TGCTGACGGA GGAGAAAATC
GCTCATGAGT AG
 
Protein sequence
MATVFTLRED NADSDIQSFL EQEQAKDILR ISTAGSVDDG KSTLIGRLLY DSRNVYEDHV 
RSVTRHDVSL GTSVVDFAQL TDGLRAEREQ GITIDVAYRY FSTAKRKFII ADTPGHEQYT
RNMATGASTS DLAIVLIDAR KGVLVQSRRH LYIAALLGIP RVVATINKMD LVDFSPEVFA
AHSLELKRLG DGLGIPSLVT IPISALDGDN VVETSARTPW YDGPSLLQFL ETVPVENASE
VAFRLPVQRV LRPHQEYRGF AGQIAAGAVR PGDQVVVLPS GRSSRVRSIT TFDGDLPSAE
APLSIALTLE DEVDVSRGDV IAAADAPPTV AKRFEASLVW MDSAEFRPAK RYLLKHTAQV
VPTTAITIKN RVNVQTFETE QSFVLHMNDI AIVEIETKRP LVGDSYRDNR TTGSFILIDP
ETNSTVAAGM IRSFVVNPSL SKHHVPIVEL RKGAVVDAVE QKLFDRGYLV VRTKLDPDRL
LQRLHLPRLV LLLEGPDVHY SRIRSFSRQT RLSVEYRVET SATPDSIVEK IVALLTEEKI
AHE