Gene Cfla_1846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1846 
Symbol 
ID9145739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2057587 
End bp2060922 
Gene Length3336 bp 
Protein Length1111 aa 
Translation table11 
GC content71% 
IMG OID 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_003636942 
Protein GI296129692 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.422098 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCGCC GCGACGACCT GAAGTCCGTC CTCGTCATCG GCTCCGGCCC GATCGTCATC 
GGGCAGGCCT GCGAGTTCGA CTACTCCGGC ACGCAGGCGT GCCGCGTGCT GAAGGAGGAG
GGCCTGCGGG TCGTCCTCGT GAACTCGAAC CCCGCCACGA TCATGACCGA CCCGGAGTTC
GCCGACGCGA CCTACGTCGA GCCGATCACG ACCGAGGTCC TCACGTCGAT CATCGCCAAG
GAGCGGCCCG ACGCGCTGCT GCCGACGCTC GGCGGCCAGA CCGCCCTCAA CGCGGCGATC
GCGCTCGACG AGGCCGGTGT CCTGGAGAAG TACGGCGTCG AGCTCATCGG CGCGAACATC
GCTGCCATCC AGAAGGGCGA GGACCGCCAG GCGTTCAAGG ACGTCGTCGA GGTCGCGGGT
GGCGAGTCCG CCCGCTCCGC GATCATCCAC ACGGTCGACG AGGCGCTCGT CGCCGCCGAG
GACCTCGGGT ACCCGATGGT CGTGCGGCCG TCGTTCACCA TGGGCGGCCT CGGCTCGGGC
CTCGCGTACG ACGAGGACGA CCTGCGCCGG ATCGTCGGGC AGGGCCTGCA CTACTCGCCG
ACCACCGAGG TGCTCCTCGA GGAGTCGATC CTCGGCTGGA AGGAGTACGA GCTCGAGCTC
ATGCGCGACA AGCACGACAA CGTCGTGGTC GTGTGCTCGA TCGAGAACGT CGACCCCGTC
GGTGTGCACA CCGGCGACTC GGTCACGGTG GCGCCGGCGC TCACGCTCAC GGACCGCGAG
TACCAGCGGC TGCGCGACAT CAGCATCGCG GTCATCCGTG AGGTCGGGGT GGACACCGGT
GGCTGCAACA TCCAGTTCGC GGTGCACCCC GACACCGGCC GGGTCATCGT CATCGAGATG
AACCCGCGCG TGTCGCGCTC GTCGGCGCTC GCGTCGAAGG CGACCGGCTT CCCGATCGCG
AAGATCGCCG CCAAGCTCGC CATCGGCTAC ACGCTCGACG AGATCCCCAA CGACATCACG
CGCTCGACGC CCGCGTCGTT CGAGCCGACC CTCGACTACG TCGTGGTCAA GGTCCCGCGG
TTCGCGTTCG AGAAGTTCCC TGCGGCCGAC GACACGCTGA CGACGACCAT GAAGTCGGTC
GGCGAGGCGA TGGCGCTGGG CCGCAACTTC ACCGAGGCGC TCGGCAAGGC GATGCGCTCG
ATCGACAAGA AGGGCTCGAC GTTCCACTGG GACGGCGAGC CGGCCACGGG GGAGGAGCTC
GAGCGGCTCG TCGCGTCGAT CTCGCGTCCC ACGGAGCACC GGCTCGTCGA CGTGCAGCAG
GTGCTGCGCG CGGGGGTCCC CGTCGACGAC GTGTACGCCC GTACCGGCAT CGACCCGTGG
TTCCTCGACC AGGTCCAGCT CGTCAACGAG GTCGCGAGGG CCACGGCCGA GGCGCCGGCG
CTCACGGCGG ACGTCCTCGA GCAGGCCAAG CGGCACGGGT TGTCGGACGT GCAGGTCGCC
GCCCTGCGGC AGACCAGCGA GGACGCCGTC CGGCGCACGC GCTGGGCGCT GGGCGTCCGA
CCGGTGTACA AGACCGTCGA CACGTGCGCG GCCGAGTTCG CGGCCCGAAC GCCGTACCAC
TACTCGTCGT ACGACGAGGA GAGCGAGGTC CAGCCGCGCC CGCGGCCGGC CATCCTCATC
CTGGGCTCCG GGCCCAACCG GATCGGCCAG GGCATCGAGT TCGACTACTC GTGCGTGCAC
GCCGCGCTGG CGCTCAAGGG CGAGTACGAG ACCGTCATGG TCAACTGCAA CCCCGAGACG
GTGTCGACCG ACTACGACAC GGCCGACCGC CTGTACTTCG AACCGCTGAC GTTCGAGGAC
GTCCTCGAGG TGTACGAGGC GGAGAAGGCC GCCGGCCCCG TGGCCGGCCT CATCGTGACG
CTCGGCGGCC AGACGCCGCT GTCGCTCGCG CAGCGGCTGT CGGACGCGGG CCTGCCGATC
CTCGGAACGC CGCCGGCGGC CATCGACGCC GCGGAGGACC GTGGCGAGTT CGGTGCCGTG
CTGGCGGCCG CCGGTCTCCC GGCGCCGGCG TTCGGCACGG CGACGACCCT GGAGGGGGCG
CGCGAGACGG CCCGTCGCAT CGGGTTCCCG GTGCTGGTCC GTCCGTCGTA CGTGCTGGGC
GGGCGCGGGA TGGAGATCGT GTACGACGAG CACCAGCTCA CCGAGTACGT CGAGCGCGCG
ATCCACGAGC AGCTGGGTGG GGACCGCGGG GGCAGCCTGC CCCCGCTGCT CATCGACCGC
TTCCTCGACG ACGCGATCGA GATCGACGTC GACGCGCTGT ACGACGGCAC CGAGCTGTTC
CTCGGTGGCG TCATGGAGCA CATCGAGGAG GCCGGCGTGC ACTCGGGCGA CTCCGCGTGC
GTGCTGCCCC CGGTGACGCT GTCGGTCGCC GAGCTCGCGC GCATCCGGGA GTCGACCGAG
GCGATCGCGC GCGGCGTGGG CGTGCGCGGG CTGCTCAACA TCCAGTTCGC CCTGGTGTCG
GACGTGCTGT ACGTGCTCGA GGCGAACCCG CGCGCGTCCC GCACGGCGCC GTTCGTCTCC
AAGGCCACGG GCGTGTCGCT CGCCAAGGCC GCGGCGCTCG TGATGGCCGG CCGGACGATC
GCCGAGCTGC GGGCGTCGGG CCTGCTGCCC GCCCAGGACG CGAGCGTGCT CGACCTCGAC
GCGCCGCTCG CGGTCAAGGA GGCCGTGCTG CCCTTCAAGC GGTTCCGCAC GGCGGACGGC
ACGGTCGTCG ACACGGTCCT GGGCCCGGAG ATGCGCTCGA CGGGTGAGGT CATGGGCTTC
GACGTCGACT TCCCGACGGC GTTCGCGAAG TCGCAGGCGG CGGCCTTCGG TGGGCTGCCG
ACGAGCGGGC GGGTGTTCAT CTCGGTCGCG GACCGCGACA AGCGGTCGAT CGTGCTCCCG
GTGAAGCGCC TGGTGGAGCT CGGGTTCGAG ATCCTCGCCA CCGAGGGCAC GGCCGCCGTG
CTCCGGCGCA GCGGCATCGT GTCGCGGATC GTGCGCAAGC ACTCGGCGGG GCGCGGACCG
GACGGCGAGC CGACGGTCGT CGACCTCATC TCCGCCGGGG AGGTGGACAT GGTCGTCAAC
ACGCCCTCGG GGCAGGGCTC GCGTGCCGAC GGGTACGAGA TCCGCGCCGC CACGACGGCG
GCGGACAAGG CGATCGTCAC GACGGTGCAG CAGCTCGGCG CCGCGGTGCA GGCCATCGAG
GCGCGCCAGG CGGGTCCGTT CAGCGTCACG AGCCTGCAGG AGCACGACGC CGCGGCGGCG
TCGCGACGTG CGGCCCTCGC GGAGGTGGGT GCGTGA
 
Protein sequence
MPRRDDLKSV LVIGSGPIVI GQACEFDYSG TQACRVLKEE GLRVVLVNSN PATIMTDPEF 
ADATYVEPIT TEVLTSIIAK ERPDALLPTL GGQTALNAAI ALDEAGVLEK YGVELIGANI
AAIQKGEDRQ AFKDVVEVAG GESARSAIIH TVDEALVAAE DLGYPMVVRP SFTMGGLGSG
LAYDEDDLRR IVGQGLHYSP TTEVLLEESI LGWKEYELEL MRDKHDNVVV VCSIENVDPV
GVHTGDSVTV APALTLTDRE YQRLRDISIA VIREVGVDTG GCNIQFAVHP DTGRVIVIEM
NPRVSRSSAL ASKATGFPIA KIAAKLAIGY TLDEIPNDIT RSTPASFEPT LDYVVVKVPR
FAFEKFPAAD DTLTTTMKSV GEAMALGRNF TEALGKAMRS IDKKGSTFHW DGEPATGEEL
ERLVASISRP TEHRLVDVQQ VLRAGVPVDD VYARTGIDPW FLDQVQLVNE VARATAEAPA
LTADVLEQAK RHGLSDVQVA ALRQTSEDAV RRTRWALGVR PVYKTVDTCA AEFAARTPYH
YSSYDEESEV QPRPRPAILI LGSGPNRIGQ GIEFDYSCVH AALALKGEYE TVMVNCNPET
VSTDYDTADR LYFEPLTFED VLEVYEAEKA AGPVAGLIVT LGGQTPLSLA QRLSDAGLPI
LGTPPAAIDA AEDRGEFGAV LAAAGLPAPA FGTATTLEGA RETARRIGFP VLVRPSYVLG
GRGMEIVYDE HQLTEYVERA IHEQLGGDRG GSLPPLLIDR FLDDAIEIDV DALYDGTELF
LGGVMEHIEE AGVHSGDSAC VLPPVTLSVA ELARIRESTE AIARGVGVRG LLNIQFALVS
DVLYVLEANP RASRTAPFVS KATGVSLAKA AALVMAGRTI AELRASGLLP AQDASVLDLD
APLAVKEAVL PFKRFRTADG TVVDTVLGPE MRSTGEVMGF DVDFPTAFAK SQAAAFGGLP
TSGRVFISVA DRDKRSIVLP VKRLVELGFE ILATEGTAAV LRRSGIVSRI VRKHSAGRGP
DGEPTVVDLI SAGEVDMVVN TPSGQGSRAD GYEIRAATTA ADKAIVTTVQ QLGAAVQAIE
ARQAGPFSVT SLQEHDAAAA SRRAALAEVG A