Gene Cfla_1403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1403 
Symbol 
ID9145289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1557144 
End bp1559804 
Gene Length2661 bp 
Protein Length886 aa 
Translation table11 
GC content73% 
IMG OID 
ProducttRNA synthetase valyl/leucyl anticodon-binding protein 
Protein accessionYP_003636500 
Protein GI296129250 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00382029 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACC AGACCCCCCG GCCCGAGACC CCGACGAGCG CACGCGCCGC CCGCCAGGTA 
CCGGACAAGG TGACGGTCGA CGGCCTGGAG GACCGCTGGT CGCGGACGTG GGCCGACGAG
GGCACGTACA CGTTCGACCG CACGGCCGAG CGTGAGCAGG TGTTCTCGAT CGACACGCCG
CCCCCGACCG TGTCGGGCTC GCTGCACGTC GGCCACGTCT TCTCCTACAC GCACACCGAC
GTCGTCGCGC GGTTCCAGCG CATGCGCGGG CGCGAGGTCC TGTACCCCAT GGGATGGGAC
GACAACGGCC TGCCCACCGA GCGCCGCGTG CAGAACTACT ACGGCGTGCG CTGCGACCCG
TCGCTGCCCT ACGTGGAGGG CTTCGTGCCG CCGCACGAGG GCGGCGAGGG CAAGTCGATC
AAGCCGGGCG ACCAGGTGCC CGTGTCGCGG CGCAACTTCA TCGAGCTGTG CGAGCGGCTG
TCCGCCGAGG ACGAGCTGCA GTTCGAGGCG CTGTGGCGCC GCCTGGGCCT GTCCGTCGAC
TGGTCGATGA CCTACCAGAC CATCGGCTCG ACCGCCCGCG CGGTCTCGCA GCGCGCCTTC
CTGCGCAACC TGCAGCGCGG CGAGGCGTAC CAGGCCGAGG CCCCGGGCCT GTGGGACGTG
ACGTTCCAGA CGGCCGTCGC GCAGGCCGAG CTCGAGGCGC GCGACTACCC CGGCGCCTTC
CACAAGGTGG CGTTCCACCG CCCGGACGGC GAGCAGGTCG TCATCGAGAC GACCCGCCCC
GAGCTGCTGC CCGCGTGCGT GGCGCTCATC GCGCACCCCG ACGACGAGCG CTACCAGCAC
CTGTTCGGCA CCACCGTGAC CAGCCCCCTG TTCGGCGTCG AGCTGCCGGT GCTCGCGCAC
CCCGCGGCCG AGCCCGACAA GGGCGCCGGC ATCGCGATGT GCTGCACCTT CGGCGACCTC
ACCGACGTGC AGTGGTGGCG CGAGCTGCGC CTGCCGACGC GTTCGGTGGT GGGCCGCGAC
GGGCGTGTCC TGCGCGACAC CCCCGAGTGG CTGACGACCG ACGAGGGCCG CGCGCGGTAC
GCCGAGCTCG CGGGCAAGAC GACGTTCAGC GCGCGCGAGG CCGTGGTCGC CGGGCTGCGC
GAGACCGGCG ACCTGCTGGG CGAGCCGGTG CCGACGCAGC GCAAGGCGAA CTTCTACGAG
AAGGGCGACA AGCCCCTCGA GATCGTCACG AGCCGGCAGT GGTACATCCG CAACGGGGGG
CGCGACGAGG ACCTGCGCGA GCAGCTCCTG GGCCGCGGGC GCGAGCTGGA GTTCCACCCC
GACTTCATGC GCGTGCGCTA CGAGAACTGG GTCGGTGGGC TCAACGGCGA CTGGCTCGTC
TCGCGCCAGC GGTTCTTCGG CGTGCCGATC CCCGTGTGGT ACCCGCTCGA CGAGCAGGGC
GAGCCGCAGT ACGACGCACC GCTGCTCCCG GGCGAGGCGG CCCTGCCGGT CGACCCGTCG
AGCGACGTGC CGGCGGGTTA CACCGAGGAC CAGCGCGGCG TGCCCGGCGG CTTCGTCGGC
GACCCCGACA TCATGGACAC CTGGGCGACG AGCTCGCTGA CGCCGCAGAT CGTGTGCGGC
TGGCTCGACG ACCCGGACCT GTTCGCGCGC ACGTTCCCCA TGGACCTGCG GCCGCAGGGT
CAGGACATCA TCCGCACCTG GCTGTTCTCG TCGGTCGTGC GGGCGCACCT CGAGTCCGGT
TCGCTGCCGT GGAAGCACGC GGCGATCAGC GGCTGGATCC TCGACCCCGA CCGCAAGAAG
ATGAGCAAGT CCAAGGGCAA CGTCGTCACG CCGCTGGGCC TGCTCGAGGA GCACGGGTCG
GACGCCGTGC GCTACTGGGC CGCCAGCGCG CGCCTGGGCA CCGACGCGGC CTTCGAGGTC
GGCCAGATGA AGATCGGCCG CCGCCTGGCG ATCAAGGTCC TCAACGCCTC GAAGTTCGTG
CTGTCGTTCG GTCCCGCCGA CGAGCCGGTG TCGCTCGACG CCGCAGCCGT CACGCAGCCG
CTGGACCGCG CGATGCTCGC CGGGCTCGCG GAGGTCGTCG AGCAGGCCAC GGCCGCGCTC
GACGCGTACG ACCACACGCG CGCCCTGGAG CTCACGGAGA CGTTCTTCTG GACGTTCTGC
GACGACTACC TCGAGCTCGT GAAGGACCGC GCGTACGGCG CGGGCGCATC GGCGGCCGAC
GTGACGCCGG AGACCGCCTC GGCGCGCGCC GCCCTGGGCC TGGCGCTCGA CACGCTGCTG
CGCCTCTTCG CCCCCGTGCT GCCGTACGCG ACCGAGGAGG TCTGGTCGTG GTGGCGCGAG
GGCACGGTGC ACCGGCAGCC CTGGCCGACG AGCGCCACGC TGCGCGCCGC CGCCGGGGAC
ACCGACCCGG GCCTGGTCGC GGGCGCCGGC GCTGCGCTCG CGGCGCTGCG CAAGGTGAAG
TCCGAGGCGA AGGTCTCGAT GCGCACGCCC GTCGAGCACG CCGTCCTCGA GGTGCCCGCC
GCGCTGCGCG TTGGAGTCGA GGCCGCGCTG GACGACGTCC GCAACGCCGG GCGCGCGACG
GGCACGCTCG AGATCGCCGG CGGCGACGGC GAGACCGTCG TGGCGCGCGA CGCCCGCCTG
GGCGAGGCCG CACCGCGCTG A
 
Protein sequence
MSDQTPRPET PTSARAARQV PDKVTVDGLE DRWSRTWADE GTYTFDRTAE REQVFSIDTP 
PPTVSGSLHV GHVFSYTHTD VVARFQRMRG REVLYPMGWD DNGLPTERRV QNYYGVRCDP
SLPYVEGFVP PHEGGEGKSI KPGDQVPVSR RNFIELCERL SAEDELQFEA LWRRLGLSVD
WSMTYQTIGS TARAVSQRAF LRNLQRGEAY QAEAPGLWDV TFQTAVAQAE LEARDYPGAF
HKVAFHRPDG EQVVIETTRP ELLPACVALI AHPDDERYQH LFGTTVTSPL FGVELPVLAH
PAAEPDKGAG IAMCCTFGDL TDVQWWRELR LPTRSVVGRD GRVLRDTPEW LTTDEGRARY
AELAGKTTFS AREAVVAGLR ETGDLLGEPV PTQRKANFYE KGDKPLEIVT SRQWYIRNGG
RDEDLREQLL GRGRELEFHP DFMRVRYENW VGGLNGDWLV SRQRFFGVPI PVWYPLDEQG
EPQYDAPLLP GEAALPVDPS SDVPAGYTED QRGVPGGFVG DPDIMDTWAT SSLTPQIVCG
WLDDPDLFAR TFPMDLRPQG QDIIRTWLFS SVVRAHLESG SLPWKHAAIS GWILDPDRKK
MSKSKGNVVT PLGLLEEHGS DAVRYWAASA RLGTDAAFEV GQMKIGRRLA IKVLNASKFV
LSFGPADEPV SLDAAAVTQP LDRAMLAGLA EVVEQATAAL DAYDHTRALE LTETFFWTFC
DDYLELVKDR AYGAGASAAD VTPETASARA ALGLALDTLL RLFAPVLPYA TEEVWSWWRE
GTVHRQPWPT SATLRAAAGD TDPGLVAGAG AALAALRKVK SEAKVSMRTP VEHAVLEVPA
ALRVGVEAAL DDVRNAGRAT GTLEIAGGDG ETVVARDARL GEAAPR