Gene Caul_1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1749 
Symbol 
ID5899204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1840240 
End bp1842168 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content67% 
IMG OID641562239 
Productbifunctional sulfate adenylyltransferase subunit 1/adenylylsulfate kinase protein 
Protein accessionYP_001683376 
Protein GI167645713 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0529] Adenylylsulfate kinase and related kinases
[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00455] adenylylsulfate kinase (apsK)
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCT CCGCCCCGAG CACGATCTAC AAGCCCTCCG ACCTGATCGC CGACGACATC 
GACGCCTATC TGGTGGCCCA CCAGCACAAG TCGCTGCTGC GCTTCATCAC CTGTGGGTCG
GTGGACGACG GCAAATCGAC CCTGATCGGC CGGCTGCTCT ACGACAGCAA GATGATCTTC
GAGGATCAGA TGGCGGCCCT CGAAGCCGAC TCCAAGCGGG TCGGCACCCA AGGCGGGGCG
ATCGACTTCG CCCTGCTGGT GGACGGTCTG GCCGCCGAGC GCGAGCAAGG CATCACCATC
GACGTAGCCT ACCGCTTCTT CGCCACCGAC AGGCGCAAGT TCATCGTCGC CGACACCCCC
GGCCACGAGC AATATACCCG CAACATGGTC ACCGGCGCCT CGACCGCCGA TGCGGCCGTG
ATCCTGATCG ACGCCCGCAA GGGCGTGCTG ACCCAGACCC GCCGCCATTC GTATCTGGTC
CAACTGCTGG GCATTCGCCA TGTGGTGCTG GCGGTGAACA AGATGGACCT GGTGGGCTGG
GACCAGGCGG TGTTCGACCG CATCGTCGCC GACTACCGCG CCTTCGCGGG CCAAATCGGC
ATGGAGGCTT TCACCCCGAT CCCGATCTCG GGCCTGACCG GCGCCAACAT GGCCTCGCGC
GGCGAGGACT CGCCGTGGTT CGACGGCCCG ATCCTGATGG ACTGGCTGGA AGGCGTCGAG
GTCGAGGACG ACCTGCGCAG CCAGTCGTTC CGCATGCCCG TGCAGTGGGT CAATCGCCCG
AACCTGGACT TCCGAGGCTT CTCCGGCCAG ATCGCCGCCG GGACGGTCAA GCCGGGCGAT
CGGGTCAAGT CGCTGCCCTC CGGCCGGGAA AGCACCGTGG CGCGGATCGT CACCCTCCCC
GATGACCTCC CCGAGGCCTA TGCCGGCCAA TCGGTGACGA TCACCCTGGC CGACGAGATC
GACGTCAGCC GCGGCGACAT CCTGGTGGCG GCCGACGACC CGGTCGCCGT GGCCGGCCAG
TTCGAGGCCA CCGTCGTCTG GATGGATGAC GAGCCCCTGC CCCCGGGCCG CTCCTACCTG
CTGAAGATCG GCGCGCGGAC GGTCGGGGCC AGCGTCACCG AGATCAAGCA CCGGGTGAAC
GTCAACACGC TGGAGCACCT GGCGGCCAAG CGGCTGGAGC TGAACGAGAT CGGCCTGGTC
AACCTGTCTC TGGACCAGGC CATTCCGTTC GAGCCCTACG CCAAGAACCG CGACCTGGGC
GGCTTCATCC TGATCGACAG GATCAGCAAT CGCACCGTCG GGGCTGGCCT GCTGAACTTT
GCCCTGCGCC GGGCCGACAA CATCCACTGG CAGCACACCG ACGTCAGCAA GGCCTCGCGA
GCGGCGCTGA AGGGCCAGCG CGGCCGGGTG GTCTGGCTGA CGGGCCTGTC GGGCGCCGGC
AAGTCGACGA TCGCCAACCT GGTCGAGAAG CGCCTGCACG CCCTTGGCCG CCACACCTAT
CTGCTGGACG GCGACAATGT GCGCCACGGG CTCAACAAGA ATCTCGGCTT CACCGAGGAG
GACCGGGTCG AGAATATCCG CCGGGTGGCC GAGGTCGCCA AGCTGATGGT CGACGCCGGG
CTGATCGTGC TGACCGCCTT CATCTCGCCG TTCCGCGCCG AGCGCCGCCT GGCGCGGGAG
ATCCTGCGGG ACGGCGAGTT CGTCGAGGTC TTCGTCGACA CCCCGCTGGC GGTGGCCGAG
CAGCGCGACG TCAAGGGCCT CTACAAGAAA GCGCGGTCGG GCCAGTTGAA GAACTTCACC
GGCATCGACA GTCCTTATGA AGCGCCCGAA GCGCCGGAAC TGCGGATCGA CACCACGAAA
ATGGACCCCG TCGCCGCCGC CGAGCGGATC GTCGCCTGGC TGGAAGGCGA GCTGGACTAC
GAGATCTAG
 
Protein sequence
MTASAPSTIY KPSDLIADDI DAYLVAHQHK SLLRFITCGS VDDGKSTLIG RLLYDSKMIF 
EDQMAALEAD SKRVGTQGGA IDFALLVDGL AAEREQGITI DVAYRFFATD RRKFIVADTP
GHEQYTRNMV TGASTADAAV ILIDARKGVL TQTRRHSYLV QLLGIRHVVL AVNKMDLVGW
DQAVFDRIVA DYRAFAGQIG MEAFTPIPIS GLTGANMASR GEDSPWFDGP ILMDWLEGVE
VEDDLRSQSF RMPVQWVNRP NLDFRGFSGQ IAAGTVKPGD RVKSLPSGRE STVARIVTLP
DDLPEAYAGQ SVTITLADEI DVSRGDILVA ADDPVAVAGQ FEATVVWMDD EPLPPGRSYL
LKIGARTVGA SVTEIKHRVN VNTLEHLAAK RLELNEIGLV NLSLDQAIPF EPYAKNRDLG
GFILIDRISN RTVGAGLLNF ALRRADNIHW QHTDVSKASR AALKGQRGRV VWLTGLSGAG
KSTIANLVEK RLHALGRHTY LLDGDNVRHG LNKNLGFTEE DRVENIRRVA EVAKLMVDAG
LIVLTAFISP FRAERRLARE ILRDGEFVEV FVDTPLAVAE QRDVKGLYKK ARSGQLKNFT
GIDSPYEAPE APELRIDTTK MDPVAAAERI VAWLEGELDY EI