Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1846 |
Symbol | |
ID | 9145739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 2057587 |
End bp | 2060922 |
Gene Length | 3336 bp |
Protein Length | 1111 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_003636942 |
Protein GI | 296129692 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.422098 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCGCC GCGACGACCT GAAGTCCGTC CTCGTCATCG GCTCCGGCCC GATCGTCATC GGGCAGGCCT GCGAGTTCGA CTACTCCGGC ACGCAGGCGT GCCGCGTGCT GAAGGAGGAG GGCCTGCGGG TCGTCCTCGT GAACTCGAAC CCCGCCACGA TCATGACCGA CCCGGAGTTC GCCGACGCGA CCTACGTCGA GCCGATCACG ACCGAGGTCC TCACGTCGAT CATCGCCAAG GAGCGGCCCG ACGCGCTGCT GCCGACGCTC GGCGGCCAGA CCGCCCTCAA CGCGGCGATC GCGCTCGACG AGGCCGGTGT CCTGGAGAAG TACGGCGTCG AGCTCATCGG CGCGAACATC GCTGCCATCC AGAAGGGCGA GGACCGCCAG GCGTTCAAGG ACGTCGTCGA GGTCGCGGGT GGCGAGTCCG CCCGCTCCGC GATCATCCAC ACGGTCGACG AGGCGCTCGT CGCCGCCGAG GACCTCGGGT ACCCGATGGT CGTGCGGCCG TCGTTCACCA TGGGCGGCCT CGGCTCGGGC CTCGCGTACG ACGAGGACGA CCTGCGCCGG ATCGTCGGGC AGGGCCTGCA CTACTCGCCG ACCACCGAGG TGCTCCTCGA GGAGTCGATC CTCGGCTGGA AGGAGTACGA GCTCGAGCTC ATGCGCGACA AGCACGACAA CGTCGTGGTC GTGTGCTCGA TCGAGAACGT CGACCCCGTC GGTGTGCACA CCGGCGACTC GGTCACGGTG GCGCCGGCGC TCACGCTCAC GGACCGCGAG TACCAGCGGC TGCGCGACAT CAGCATCGCG GTCATCCGTG AGGTCGGGGT GGACACCGGT GGCTGCAACA TCCAGTTCGC GGTGCACCCC GACACCGGCC GGGTCATCGT CATCGAGATG AACCCGCGCG TGTCGCGCTC GTCGGCGCTC GCGTCGAAGG CGACCGGCTT CCCGATCGCG AAGATCGCCG CCAAGCTCGC CATCGGCTAC ACGCTCGACG AGATCCCCAA CGACATCACG CGCTCGACGC CCGCGTCGTT CGAGCCGACC CTCGACTACG TCGTGGTCAA GGTCCCGCGG TTCGCGTTCG AGAAGTTCCC TGCGGCCGAC GACACGCTGA CGACGACCAT GAAGTCGGTC GGCGAGGCGA TGGCGCTGGG CCGCAACTTC ACCGAGGCGC TCGGCAAGGC GATGCGCTCG ATCGACAAGA AGGGCTCGAC GTTCCACTGG GACGGCGAGC CGGCCACGGG GGAGGAGCTC GAGCGGCTCG TCGCGTCGAT CTCGCGTCCC ACGGAGCACC GGCTCGTCGA CGTGCAGCAG GTGCTGCGCG CGGGGGTCCC CGTCGACGAC GTGTACGCCC GTACCGGCAT CGACCCGTGG TTCCTCGACC AGGTCCAGCT CGTCAACGAG GTCGCGAGGG CCACGGCCGA GGCGCCGGCG CTCACGGCGG ACGTCCTCGA GCAGGCCAAG CGGCACGGGT TGTCGGACGT GCAGGTCGCC GCCCTGCGGC AGACCAGCGA GGACGCCGTC CGGCGCACGC GCTGGGCGCT GGGCGTCCGA CCGGTGTACA AGACCGTCGA CACGTGCGCG GCCGAGTTCG CGGCCCGAAC GCCGTACCAC TACTCGTCGT ACGACGAGGA GAGCGAGGTC CAGCCGCGCC CGCGGCCGGC CATCCTCATC CTGGGCTCCG GGCCCAACCG GATCGGCCAG GGCATCGAGT TCGACTACTC GTGCGTGCAC GCCGCGCTGG CGCTCAAGGG CGAGTACGAG ACCGTCATGG TCAACTGCAA CCCCGAGACG GTGTCGACCG ACTACGACAC GGCCGACCGC CTGTACTTCG AACCGCTGAC GTTCGAGGAC GTCCTCGAGG TGTACGAGGC GGAGAAGGCC GCCGGCCCCG TGGCCGGCCT CATCGTGACG CTCGGCGGCC AGACGCCGCT GTCGCTCGCG CAGCGGCTGT CGGACGCGGG CCTGCCGATC CTCGGAACGC CGCCGGCGGC CATCGACGCC GCGGAGGACC GTGGCGAGTT CGGTGCCGTG CTGGCGGCCG CCGGTCTCCC GGCGCCGGCG TTCGGCACGG CGACGACCCT GGAGGGGGCG CGCGAGACGG CCCGTCGCAT CGGGTTCCCG GTGCTGGTCC GTCCGTCGTA CGTGCTGGGC GGGCGCGGGA TGGAGATCGT GTACGACGAG CACCAGCTCA CCGAGTACGT CGAGCGCGCG ATCCACGAGC AGCTGGGTGG GGACCGCGGG GGCAGCCTGC CCCCGCTGCT CATCGACCGC TTCCTCGACG ACGCGATCGA GATCGACGTC GACGCGCTGT ACGACGGCAC CGAGCTGTTC CTCGGTGGCG TCATGGAGCA CATCGAGGAG GCCGGCGTGC ACTCGGGCGA CTCCGCGTGC GTGCTGCCCC CGGTGACGCT GTCGGTCGCC GAGCTCGCGC GCATCCGGGA GTCGACCGAG GCGATCGCGC GCGGCGTGGG CGTGCGCGGG CTGCTCAACA TCCAGTTCGC CCTGGTGTCG GACGTGCTGT ACGTGCTCGA GGCGAACCCG CGCGCGTCCC GCACGGCGCC GTTCGTCTCC AAGGCCACGG GCGTGTCGCT CGCCAAGGCC GCGGCGCTCG TGATGGCCGG CCGGACGATC GCCGAGCTGC GGGCGTCGGG CCTGCTGCCC GCCCAGGACG CGAGCGTGCT CGACCTCGAC GCGCCGCTCG CGGTCAAGGA GGCCGTGCTG CCCTTCAAGC GGTTCCGCAC GGCGGACGGC ACGGTCGTCG ACACGGTCCT GGGCCCGGAG ATGCGCTCGA CGGGTGAGGT CATGGGCTTC GACGTCGACT TCCCGACGGC GTTCGCGAAG TCGCAGGCGG CGGCCTTCGG TGGGCTGCCG ACGAGCGGGC GGGTGTTCAT CTCGGTCGCG GACCGCGACA AGCGGTCGAT CGTGCTCCCG GTGAAGCGCC TGGTGGAGCT CGGGTTCGAG ATCCTCGCCA CCGAGGGCAC GGCCGCCGTG CTCCGGCGCA GCGGCATCGT GTCGCGGATC GTGCGCAAGC ACTCGGCGGG GCGCGGACCG GACGGCGAGC CGACGGTCGT CGACCTCATC TCCGCCGGGG AGGTGGACAT GGTCGTCAAC ACGCCCTCGG GGCAGGGCTC GCGTGCCGAC GGGTACGAGA TCCGCGCCGC CACGACGGCG GCGGACAAGG CGATCGTCAC GACGGTGCAG CAGCTCGGCG CCGCGGTGCA GGCCATCGAG GCGCGCCAGG CGGGTCCGTT CAGCGTCACG AGCCTGCAGG AGCACGACGC CGCGGCGGCG TCGCGACGTG CGGCCCTCGC GGAGGTGGGT GCGTGA
|
Protein sequence | MPRRDDLKSV LVIGSGPIVI GQACEFDYSG TQACRVLKEE GLRVVLVNSN PATIMTDPEF ADATYVEPIT TEVLTSIIAK ERPDALLPTL GGQTALNAAI ALDEAGVLEK YGVELIGANI AAIQKGEDRQ AFKDVVEVAG GESARSAIIH TVDEALVAAE DLGYPMVVRP SFTMGGLGSG LAYDEDDLRR IVGQGLHYSP TTEVLLEESI LGWKEYELEL MRDKHDNVVV VCSIENVDPV GVHTGDSVTV APALTLTDRE YQRLRDISIA VIREVGVDTG GCNIQFAVHP DTGRVIVIEM NPRVSRSSAL ASKATGFPIA KIAAKLAIGY TLDEIPNDIT RSTPASFEPT LDYVVVKVPR FAFEKFPAAD DTLTTTMKSV GEAMALGRNF TEALGKAMRS IDKKGSTFHW DGEPATGEEL ERLVASISRP TEHRLVDVQQ VLRAGVPVDD VYARTGIDPW FLDQVQLVNE VARATAEAPA LTADVLEQAK RHGLSDVQVA ALRQTSEDAV RRTRWALGVR PVYKTVDTCA AEFAARTPYH YSSYDEESEV QPRPRPAILI LGSGPNRIGQ GIEFDYSCVH AALALKGEYE TVMVNCNPET VSTDYDTADR LYFEPLTFED VLEVYEAEKA AGPVAGLIVT LGGQTPLSLA QRLSDAGLPI LGTPPAAIDA AEDRGEFGAV LAAAGLPAPA FGTATTLEGA RETARRIGFP VLVRPSYVLG GRGMEIVYDE HQLTEYVERA IHEQLGGDRG GSLPPLLIDR FLDDAIEIDV DALYDGTELF LGGVMEHIEE AGVHSGDSAC VLPPVTLSVA ELARIRESTE AIARGVGVRG LLNIQFALVS DVLYVLEANP RASRTAPFVS KATGVSLAKA AALVMAGRTI AELRASGLLP AQDASVLDLD APLAVKEAVL PFKRFRTADG TVVDTVLGPE MRSTGEVMGF DVDFPTAFAK SQAAAFGGLP TSGRVFISVA DRDKRSIVLP VKRLVELGFE ILATEGTAAV LRRSGIVSRI VRKHSAGRGP DGEPTVVDLI SAGEVDMVVN TPSGQGSRAD GYEIRAATTA ADKAIVTTVQ QLGAAVQAIE ARQAGPFSVT SLQEHDAAAA SRRAALAEVG A
|
| |