Gene Cfla_0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0035 
Symbol 
ID9143900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp41751 
End bp44126 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content70% 
IMG OID 
Productguanosine pentaphosphate synthetase I/polyribonucleotide nucleotidyltransferase 
Protein accessionYP_003635154 
Protein GI296127904 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCG CCGACCTATT CGCAGGATCA CGGATAGAAC TCCTCTCGAC GGGCCCATCG 
CAGGTCTCTG AGGCCGAGTC CGAAAGAATT CGCGAAGAAG TCGCCCAAGC ATTGCGTGCG
CACGTCCCAG TCCAGTTCGC CGAGGCCACG ATCGACAACG GTCGCTTCGG CACCCGCACC
GTCCGCTTCG AGACGGGCCG CCTGGCCAAG CAGGCCGCAG GGTCGGCCGT CGCCTACCTC
GACGACGACA CCATGCTGCT GTCGGCCACG ACGGCCGGTA AGCACCCCCG TGAGGCGTTC
GACTTCTTCC CCCTGACGGT CGACGTCGAG GAGCGGCAGT ACGCCGCCGG CAAGATCCCC
GGCGCGTTCT TCCGCCGCGA GGGCCGCCCC TCGACCGACG CGATCCTCGC CTGCCGCCTG
ATCGACCGCC CGCTGCGCCC CCTGTTCGTC AAGGGCCTGC GCAACGAGGT CCAGGTCGTC
GTGACGGTGC TGTCGATCAA CCCCGACGAC GCGTACGACG TGCTCGCCAT CAACGCCGCG
TCGATCTCGA CACAGCTGTC CGGCCTGCCG TTCTCCGGCC CGGTGGCCGC GACGCGTCTC
GCGCTCGTCG ACGGCCAGTG GGTCGCGTTC CCGCGCTACT CCGAGCGCGA GCGCTCGACG
TTCGACATCG TCGTGGCCGG CCGCGTGGTC GGGGATGACG TCGCGATCGC CATGATCGAG
GCCGACGCCC CCGAGAACGC CTGGAACCTC ATCCACGGCG GCGGCGGCAC CGCGCCGACC
GAGGAGGTCG TGGCGCAGGG CCTCGAGGCG TCGAAGCCGT TCATCAAGGT CCTCGTCGAG
GCGCAGCAGC AGCTCGCGGC CCAGGCCGCG AAGGAGACGC AGACCTTCCC GACGTTCCCG
GACTACCAGC CCGACGCGTT CGCCGCCGTC GAGCAGGCCG CGTCCGAGCG CCTGTCCGCC
GCGCTGCAGA TCGCCGGCAA GCAGGAGCGC GAGAACCGCC TGGACGAGAT CAAGGGCGAG
GTCGCCGGCG AGCTCGCGGC GCAGTTCGAG GGCCGCGAGA AGGAGATCTC CGCCGCGTAC
CGCTCGCTGC AGAAGCAGCT CATCCGCCAG CGGATCCTCA CGGACGGCTT CCGGATCGAC
GGTCGCGGCC TGCGTGACAT CCGGACGCTC TCGGCCGAGG TCGAGGTGCT GCCCCGCACG
CACGGCTCGG CGCTGTTCGA GCGCGGTGAG ACGCAGATCC TCGGCGTCAC GACGCTGAAC
ATGCTGCGGA TGGAGCAGCA GATCGACTCG CTGTCGCCCG AGACGCGCAA GCGGTACATG
CACCACTACA ACTTCCCGCC CTACTCGACC GGTGAGACCG GCCGCGTCGG GTCGCCGAAG
CGCCGCGAGA TCGGCCACGG TGCGCTCGCC GAGCGGGCGA TCGTGCCCGT GCTGCCCGCG
CGCGAGGAGT TCCCGTACGC GATCCGGCAG GTCTCCGAGG CGCTGGGCTC CAACGGCTCG
ACGTCCATGG GCTCCGTCTG CGCCGCGACG CTGTCGCTGC TCAACGCCGG TGTCCCGCTG
CGCGCGCCCG TCGCGGGCAT CGCGATGGGC CTGGTGTCCG ACACGGTCGA CGGTGAGACC
CGCTACGCGG CCCTCACCGA CATCCTGGGC GCCGAGGACG CGTTCGGCGA CATGGACTTC
AAGGTCGCCG GCACGCGCGA GTTCGTCACC GCGATCCAGC TCGACACCAA GCTCGACGGC
ATCCCCGCCT CGGTCCTGGC CGGCGCGCTG ACGCAGGCCA AGGAGGCGCG CCTGGCGATC
CTCGACGTCA TCGCCGAGGC GATCGACGTC CCGGACGAGA TGAGCCCGTT CGCCCCGCGC
GTCATCTCGG TGAAGGTCCC GGTCGACAAG ATCGGCGAGG TCATCGGCCC GAAGGGCAAG
ATGATCAACC AGATCCAGGA GGAGACCGGC GCCGACATCT CCATCGAGGA CGACGGCACG
GTCTACATCG GCGCCACCGA CGGACCGTCG GCGGAGGCCG CGCGGGCCGC GATCAACGCG
ATCGCGAACC CGCACATGCC CGAGATCGGC GAGCGCTTCG TCGGCACCGT CGTCAAGACG
ACGACGTTCG GCGCGTTCAT CTCGCTGTCC CCGGGCAAGG ACGGTCTGCT GCACATCTCG
CAGATCCGCA AGCTCGTCGG CGGCAAGCGC GTCGAGAACG TCGAGGACGT CCTGGCCGTC
GGCCAGAAGG TCCAGGTCGA GATCGGCGAG ATCGACCCGC GCGGCAAGCT GTCGCTGCAC
GCGGTCCTCG ACGAGGCGCA GACCGAGGGC GACCCGGCCG ACCCGCCGGC GGGCGTAGAG
GTCGCGACCG CTACGGCATA CCGAATTAAC TGGTGA
 
Protein sequence
MTRADLFAGS RIELLSTGPS QVSEAESERI REEVAQALRA HVPVQFAEAT IDNGRFGTRT 
VRFETGRLAK QAAGSAVAYL DDDTMLLSAT TAGKHPREAF DFFPLTVDVE ERQYAAGKIP
GAFFRREGRP STDAILACRL IDRPLRPLFV KGLRNEVQVV VTVLSINPDD AYDVLAINAA
SISTQLSGLP FSGPVAATRL ALVDGQWVAF PRYSERERST FDIVVAGRVV GDDVAIAMIE
ADAPENAWNL IHGGGGTAPT EEVVAQGLEA SKPFIKVLVE AQQQLAAQAA KETQTFPTFP
DYQPDAFAAV EQAASERLSA ALQIAGKQER ENRLDEIKGE VAGELAAQFE GREKEISAAY
RSLQKQLIRQ RILTDGFRID GRGLRDIRTL SAEVEVLPRT HGSALFERGE TQILGVTTLN
MLRMEQQIDS LSPETRKRYM HHYNFPPYST GETGRVGSPK RREIGHGALA ERAIVPVLPA
REEFPYAIRQ VSEALGSNGS TSMGSVCAAT LSLLNAGVPL RAPVAGIAMG LVSDTVDGET
RYAALTDILG AEDAFGDMDF KVAGTREFVT AIQLDTKLDG IPASVLAGAL TQAKEARLAI
LDVIAEAIDV PDEMSPFAPR VISVKVPVDK IGEVIGPKGK MINQIQEETG ADISIEDDGT
VYIGATDGPS AEAARAAINA IANPHMPEIG ERFVGTVVKT TTFGAFISLS PGKDGLLHIS
QIRKLVGGKR VENVEDVLAV GQKVQVEIGE IDPRGKLSLH AVLDEAQTEG DPADPPAGVE
VATATAYRIN W