Gene Cfla_1735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1735 
Symbol 
ID9145624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1929610 
End bp1931040 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content74% 
IMG OID 
Productsulfate adenylyltransferase, large subunit 
Protein accessionYP_003636831 
Protein GI296129581 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.218793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.135983 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC ACGCACCCGA GCTCGTCGCA CCGACCGACG ACCACACGCA GCGCGACCTG 
CTGCGTCTGG CGACCGCCGG GTCGGTCGAC GACGGCAAGA GCACGCTCAT CGGCCGTCTG
CTGTACGACA CGAAGTCGGT GCTCGCGGAC CAGCTCTCGG CCGTGGAGCG CGCGACCGCG
GCACGCGGCG GCGACGGCAC GGGCGTCGAC CTGGCGCTGC TCACCGACGG CCTGCGCGCC
GAGCGCGAGC AGGGCATCAC CATCGACGTC GCCTACCGCT ACTTCTCGAC CGCCCGCCGG
GCGTTCGTGC TCGCGGACAC GCCCGGCCAC GTGCAGTACA CGCGCAACAT GGTCACCGGC
GCGTCCACGG CCGAGCTCGC GATCGTGCTG GTCGACGCAC GCAAGGGCGT GCTCGAGCAG
ACGCGCCGGC ACGCGGCCCT GACCGCGCTG CTCGGCGTGC CGCACGTCGT GCTGGCGGTC
AACAAGATGG ACCTCGTCGG CTTCGACGAG GCCACCTTCC GGTCGATCGC CGCCGAGTTC
ACCGACTACG CACGCGTCCT GGGCCTGCCG GACGTGACGT GCGTGCCGCT GTCGGCGCTC
GACGGCGACA ACGTCGTCGA CCGCTCCCCG CGCGCACCCT GGTACGACGG GCCGACGCTG
CTCGAGCTCC TCGAGGCCGT GCCCGTCGCC CGCGACGCCA CGGCCGAGCC GCTGCGCCTG
CCCGTGCAGT ACGTCATCCG CCCGCGCACA CCGGAGCACC CCGACTACCG CGGGTACGCG
GGCAAGGTCG CCTCGGGTGT CGTGCGCGTC GGCGACGAGG TGCGCGTCCT GCCGTCGGGG
CGCTCCAGCC GCGTGGTCGG CATCGACACG TTCGACGGCC CGCTCGCCGC GGCCGAGGCC
CCGCGCTCGG TGACGGTGCG GCTCGCGGAC GACCTCGACG TGGCGCGCGG CGACGTGCTC
GTGCCTGCGG GCGAGCAGGT CACGACGGGC CAGGACCTCG TCGGCACGGT CTGCTGGCTC
ACCGAGCGGC GATCGGTCAG CGGGGCCCGC GTGCTCGTGC GCGTGGGGAC CCGCACGGTG
CGCGGCCTGC TGCGGGAGGT CGACGCGCGC CTCGACGTCG ACACGCTGAC CGTCGAGCAG
TGGGACCCGG TCGACACGGT GCAGACGATC GACGCGACGG ACACCTCGAC GGAGGCCGCG
CCGCGCTCGC TCGGCCTCAA CGCGATCGGA CGGCTGCGGC TGCGGCTCGC CGAGCCGGTC
GTCCTCGACG ACTACGCGAC GCACCGGCGC ACGGGCGGTT TCCTCGTCGT CGACCCGGCG
GACGGCAGCA CGCTCGCGGC CGGGCTGATC GGTCCCACGC TGCTCGACCG CCTGGTGCCG
GTGCGCGCCG ACGAGCGCGA CGACGACTGG CTCGCCGGGG CCGGGATATG A
 
Protein sequence
MSTHAPELVA PTDDHTQRDL LRLATAGSVD DGKSTLIGRL LYDTKSVLAD QLSAVERATA 
ARGGDGTGVD LALLTDGLRA EREQGITIDV AYRYFSTARR AFVLADTPGH VQYTRNMVTG
ASTAELAIVL VDARKGVLEQ TRRHAALTAL LGVPHVVLAV NKMDLVGFDE ATFRSIAAEF
TDYARVLGLP DVTCVPLSAL DGDNVVDRSP RAPWYDGPTL LELLEAVPVA RDATAEPLRL
PVQYVIRPRT PEHPDYRGYA GKVASGVVRV GDEVRVLPSG RSSRVVGIDT FDGPLAAAEA
PRSVTVRLAD DLDVARGDVL VPAGEQVTTG QDLVGTVCWL TERRSVSGAR VLVRVGTRTV
RGLLREVDAR LDVDTLTVEQ WDPVDTVQTI DATDTSTEAA PRSLGLNAIG RLRLRLAEPV
VLDDYATHRR TGGFLVVDPA DGSTLAAGLI GPTLLDRLVP VRADERDDDW LAGAGI