Gene Cfla_2081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2081 
Symbol 
ID9145977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2323545 
End bp2325368 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content67% 
IMG OID 
Productcytochrome c oxidase, subunit I 
Protein accessionYP_003637175 
Protein GI296129925 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCGC ACACCGAGGT CATCCCGGGC CTGTCGCCCC GGCGGCAGAC GCTGGGGCGG 
ACGGTCGTGA AGTGGCTCAC CTCCACCGAC CACAAGACGA TCGGGTACAT GTACCTGATC
ACGTCGTTCG TGTGGTTCGC GATCGGCGGG ATCCTGGCGC TGCTCATCCG CGCCGAGCTG
TTCCAGCCCG GGATGGACCT GTTCCAGTCG AAGGAGCAGT ACAACCAGGC GTTCACGATG
CACGGCACGA TCATGCTGCT GCTCTTCGCG ACGCCGCTGT TCGCGGGCTT CGCGAACATC
ATCATGCCGC TGCAGATCGG CGCCCCGGAC GTGGCGTTCC CGCGCCTCAA CATGTTCGCG
TACTGGCTGT ACCTGTTCGG CGGGCTCATC GCGGCCGCCG GCTTCCTCAC GCCGCAGGGT
GCCGCGTCGT TCGGCTGGTT CGCCTACGCG CCGCTGTCCA ACCAGCTCTA CTCACCGGGT
CTGGGGGGAG ACCTGTGGGT CTTCGGCCTC GCGCTGGGCG GCTTCGGCAC CATCCTCGGG
GCCGTCAACT TCATCACCAC CGTGGTCACG ATGCGTGCGC CCGGCATGAC GATGTTCCGC
ATGCCGATCT TCACCTGGAA CATCCTGGTG ACGTCGCTGC TCGTGCTCAT GGCGTTCCCG
CCGCTGGCTG CGGCGCTGTT CGCGCTCGGC GCCGACCGCC GCCTGGGTGC GCAGGTGTTC
AACCCCGACA ACGGCGGGGC GCTGCTGTGG CAGCACCTGT TCTGGTTCTT CGGGCACCCG
GAGGTCTACA TCATCGCGCT GCCGTTCTTC GGCATCGTGT CGGAGATCCT GCCGGTCTTC
TCCCGCAAGC CGATCTTCGG CTACAAGGGC CTGGTCTACG CGACGATCGC GATCGCAGCC
CTGTCCGTCA CCGTCTGGGC GCACCACATG TACGCGACCG GCGCGGTCCT GCTGCCCTTC
TTCGCCTTCA TGACGATGCT CATCGCCGTG CCGACCGGTG TGAAGTTCTT CAACTGGATC
GGCACGATGT GGCGCGGGAA GCTGACGTTC GAGACGCCCA TGCTGTGGAG CATCGGGTTC
CTCGTGACGT TCCTCTTCGG CGGCCTGACG GGCGTCATCC TGTCGAGTCC GGCACTCGAC
TTCCACCTGT CCGACACGTA CTTCGTCGTC GCGCACTTCC ACTACGTCGT CTTCGGGACC
GTGGTGTTCG CGATGTTCGC CGGCTTCTAC TTCTGGTGGC CGAAGTTCAC CGGGCGCATG
CTCGACGAGC GACTCGGCAA GCTGCACTTC TGGCTCCTGT TCGTGGGCTT CCACATGACG
TTCCTCGTCC AGCACTGGCT GGGTGTCATC GGCATGCCGC GCCGGTACGC CGACTACTCG
CCGGCGGACG GGTTCACGTG GATGAACCAG CTCTCGACGG TCGGGTCGAT GATCCTCGCG
GCGTCGACGC TGCCGTTCCT CTGGAACGTC TACGTCACCT GGCGCAACGC CCCGAAGGTG
ACGGTCGACG ACCCGTGGGG CTACGGCGCC TCCCTGGAGT GGGCCACGAG CTGCCCGCCG
CCGCGGCACA ACTTCGTCTC GCTGCCCCGG ATCCGTTCCG AGCGTCCGGC GTTCGACCTG
CACCACCCGG AGGTCGCCGC GATGGACCAC GTGGCACCGG AGGACCCCGG GCCGCTGGAC
TGGGCGCCGC AGCAGACCGG TGAGCGGGAG CTGGCCGAGG AGCGGATCGC CCGTGGTTCC
GGCGAGCAGG ACCAGTCGTC GGCGACGGTC GTCGAGTCGG ACATCGAGGA CGTGCGCGAG
CGGCGCGAGG AGGAGGAGCG GTGA
 
Protein sequence
MAAHTEVIPG LSPRRQTLGR TVVKWLTSTD HKTIGYMYLI TSFVWFAIGG ILALLIRAEL 
FQPGMDLFQS KEQYNQAFTM HGTIMLLLFA TPLFAGFANI IMPLQIGAPD VAFPRLNMFA
YWLYLFGGLI AAAGFLTPQG AASFGWFAYA PLSNQLYSPG LGGDLWVFGL ALGGFGTILG
AVNFITTVVT MRAPGMTMFR MPIFTWNILV TSLLVLMAFP PLAAALFALG ADRRLGAQVF
NPDNGGALLW QHLFWFFGHP EVYIIALPFF GIVSEILPVF SRKPIFGYKG LVYATIAIAA
LSVTVWAHHM YATGAVLLPF FAFMTMLIAV PTGVKFFNWI GTMWRGKLTF ETPMLWSIGF
LVTFLFGGLT GVILSSPALD FHLSDTYFVV AHFHYVVFGT VVFAMFAGFY FWWPKFTGRM
LDERLGKLHF WLLFVGFHMT FLVQHWLGVI GMPRRYADYS PADGFTWMNQ LSTVGSMILA
ASTLPFLWNV YVTWRNAPKV TVDDPWGYGA SLEWATSCPP PRHNFVSLPR IRSERPAFDL
HHPEVAAMDH VAPEDPGPLD WAPQQTGERE LAEERIARGS GEQDQSSATV VESDIEDVRE
RREEEER