Gene Arth_2216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2216 
Symbol 
ID4445277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2491711 
End bp2493438 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content61% 
IMG OID639690025 
Productcytochrome-c oxidase 
Protein accessionYP_831696 
Protein GI116670763 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00587232 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTACGT ACACTCAATC CGCACCTGCC GGGGCCCTTG GGGCGCCCGT TGTTCCGAAG 
TCCAAGGGAC GCATCGTCGT CAACTGGATC ACTTCGACCG ACCACAAGAC CATCGGGTAC
ATGTACCTGA TCTCGTCCTT CGTGTTCTTC TGCTTCGGCG GCGTCATGGC GCTGCTGATC
CGCGCCGAAC TTTTCGAGCC CGGAATGCAG ATCCTGCAGA CCAAAGAGCA GTACAACCAG
CTGTTCACCA TGCACGGAAC CGTCATGCTG CTGATGTTTG CGACCCCGCT GTTCGCCGGC
TTCGCCAACG TCATCATGCC CCTGCAGATC GGTGCACCCG ACGTCGCCTT CCCGCGACTG
AACGCACTGG CTTTCTGGTT CTTCCTCTTC GGCTCCACGA TCGCCGTCTC CGGCTTCATT
ACGCCCCAGG GTGCCGCTTC GTTTGGCTGG TTCGCGTACG CGCCGCTGTC CAACACCACA
TTCAGCCCCG GCGTCGGCGG TGACCTCTGG GTGTTCGGCC TCGCACTCTC CGGCTTCGGC
ACCATCCTCG GTGCAGTCAA CTTCATCACC ACCATCATCT GCATGCGCGC TCCGGGCATG
ACCATGTGGC GCATGCCGAT CTTTACCTGG AACACGCTGG TTACGGCCAT CCTGGTCCTC
ATGGCCTTCC CGCCTCTCGC TGCAGCCCTG TTCGCCCTCG GCGCGGACCG CCGCTTCGGA
GCACACATCT TCGATCCCGA GAACGGCGGT GCAGTCCTCT GGCAGCACCT GTTCTGGTTC
TTTGGCCACC CCGAGGTGTA CATCATCGCG CTGCCGTTCT TCGGCATCGT CTCCGAGATC
TTCCCGGTCT TCAGCCGCAA GCCGATCTTC GGCTACAAGG GCCTCGTGTA CGCAACCATC
GCCATCGCTG CTCTGTCCGT GACCGTGTGG GCTCACCACA TGTACGTCAC CGGCTCGGTC
CTCCTGCCGT TCTTCTCCTT CATGACGATG CTGATCGCCG TACCTACCGG CGTGAAGTTC
TTCAACTGGA TCGGCACCAT GTGGCGGGGT TCCATCACCT TCGAAACGCC CATGCTCTGG
AGCATCGGCT TCCTGGCAAC CTTCCTGTTC GGTGGTTTGA CGGGCATCAT CCTGGCTTCA
CCGCCCCTTG ACTTCCACGT ATCGGATTCC TACTTCGTGG TGGCCCACTT CCACTACGTG
GTGTTTGGCA CCGTGGTATT CGCAATGTTC GCCGGCTTCT ACTTCTGGTG GCCGAAGTGG
ACCGGCAAGA TGCTCAACGA GCGCCTGGGC AAGATCCACT TCTGGCTCCT GTTCCTTGGT
TTCCACGGAA CCTTCCTGAT TCAGCACTGG CTGGGTGTCG AGGGCATGCC CCGCCGCTAC
GCGGACTACA TGCCGCAGGA CAACTTCACG TGGATGAACC AGTTCTCCAC AATCTCCTCG
TTCGTGCTGG GCGCTTCGCT GATCCCGTTC TTCTGGAACG TGTACATCAC CTGGCGCAGC
AACGAAAAGG TTGAAGTGGA CGATCCCTGG GGCTTCGGTG CTTCTCTCGA GTGGGCAACC
TCTTGCCCGC CGCCGCGCCA CAACTTCACG TCGCTGCCCC GGATCCGCTC GGAGCGTCCT
GCCCTGGACC TCCACCACCC GGAGCTCGCA CAGTCGCACA CCGTTGAATC ACCGGCACCG
GCAGCGTCCG TGCTGGGCAA CGCAGATCAG AAGGACACCG CCAAGTGA
 
Protein sequence
MATYTQSAPA GALGAPVVPK SKGRIVVNWI TSTDHKTIGY MYLISSFVFF CFGGVMALLI 
RAELFEPGMQ ILQTKEQYNQ LFTMHGTVML LMFATPLFAG FANVIMPLQI GAPDVAFPRL
NALAFWFFLF GSTIAVSGFI TPQGAASFGW FAYAPLSNTT FSPGVGGDLW VFGLALSGFG
TILGAVNFIT TIICMRAPGM TMWRMPIFTW NTLVTAILVL MAFPPLAAAL FALGADRRFG
AHIFDPENGG AVLWQHLFWF FGHPEVYIIA LPFFGIVSEI FPVFSRKPIF GYKGLVYATI
AIAALSVTVW AHHMYVTGSV LLPFFSFMTM LIAVPTGVKF FNWIGTMWRG SITFETPMLW
SIGFLATFLF GGLTGIILAS PPLDFHVSDS YFVVAHFHYV VFGTVVFAMF AGFYFWWPKW
TGKMLNERLG KIHFWLLFLG FHGTFLIQHW LGVEGMPRRY ADYMPQDNFT WMNQFSTISS
FVLGASLIPF FWNVYITWRS NEKVEVDDPW GFGASLEWAT SCPPPRHNFT SLPRIRSERP
ALDLHHPELA QSHTVESPAP AASVLGNADQ KDTAK