Gene Acid345_0452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0452 
Symbol 
ID4071699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp540175 
End bp541611 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content57% 
IMG OID637982456 
Productcytochrome c family protein 
Protein accessionYP_589531 
Protein GI94967483 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.395423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.353435 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTGAAC CGAATCAGCG ACCAACGTGG TTTTTGATTT CGCAGCACTG GCTGAGCGTG 
ACCGGGGTCG TGCTCGTTAT CACCGCAGTG CTCACGTGGA TATTCATCCT GCCAGTTCAA
TTGCGCGGGC ATGTGGATAA TCCGTATGCG GGACTGGTTG CGTTCGTTCT TCTTCCCGTG
GTGTTTTTCG GGGGGCTCGT CCTCACCCCA ATCGGGCTCT TTCTGGCAAA ACGCCGGATC
CGTCACGGTT TTTCGTCAGG TGGGTTCGAT CGGAAGAACG CACTGCGCCG CATCGCCATC
GTCGTGGGCG TGACGACAGT CCTGAACATC CTAATAGGCA CACAGGTTTC CTATCGCGCC
GTGAGCCACA TGGAGACACC TCAATTCTGC GGGGGGACGT GCCACGTAAT GGCGCCGGAG
TACGCGGCAT ACCAGAACTC GCCCCACTCC AGGGTGGAAT GTGTCGGGTG TCACGTTGCT
CCCGGCGCTT CGGGCTGGGT CAGCAGTAAA GCGGCGGGCA CGCGCCAGCT CGTAGAGACA
ATTCTAAAAT CCAGCCCCAA GCCGATTCCT TCCGCAATCG AAACCAACCG CCTCGTGCCT
GCGCGGGAGA CATGCGAACA CTGCCATTGG CCGGAGAAAT TCTCAGGCGT GAATCTACGA
GTTCTGACGA AATATGCGCC GGACGAAACC AATACAAGGA CGCAGACCGT CCTCCTAATG
ATGGTGGGGG GGGACAAATA TAAGGGCATT CACGGCGCAC ATGTCGGCCC CGGAATTCAC
ATCCGGTTTG CTGCATCCGA TCCTAAGAGA CAGACGATTA CACGGGTACA GTATGAGAAT
GAGTCTTCCG GCCTAAAAGA AGAGTTCGTC GCATCCGACA GCCAGAAGGC GGCGCCGGAT
GGCACGGCGA CGATCGAGAT GCAGTGCGTG GATTGCCACA ACCGTCCGAC TCACACGTTC
GAAATGCCCG AGCCTGGACT GGACAAAGCA CTCGCGCTCG GAGAGATTGC CGTGACCCTG
CCTTATGTCA AGAAGGAGAG CGCGCAATTG CTGCAGGCGA CTTACACGAG CCAGGCAGAG
GCGTCGGAGA AGATTCCTTC CCAATTAAAC GCCTACTATC AGCAAAACTA TCCCAGCGTT
TACAGCCAGC GTGGGGCAGA AGTCGATCGT GCCGGGAAAG CGGTCCTCGC GATTTACAAC
CGCAACGTTT TTCCGGAGCT TGGAGTTACA TGGGGAACCT ATCCGAACAA TCTCGGCCAC
ACTGAGTCCC CCGGCTGCTT CCGCTGTCAC GATGGCTCGC ACACTTCAAG TTCAGGCAAA
ACCATTCCGC AGGATTGCAA CAGCTGTCAC GAACCCCTGG CGATGGACGA GGCATCTCCG
GAAATTCTTC AGAAGCTTGG CATCGCCGAG CGCATTTCCG CTCTTCAGCG AAAATGA
 
Protein sequence
MPEPNQRPTW FLISQHWLSV TGVVLVITAV LTWIFILPVQ LRGHVDNPYA GLVAFVLLPV 
VFFGGLVLTP IGLFLAKRRI RHGFSSGGFD RKNALRRIAI VVGVTTVLNI LIGTQVSYRA
VSHMETPQFC GGTCHVMAPE YAAYQNSPHS RVECVGCHVA PGASGWVSSK AAGTRQLVET
ILKSSPKPIP SAIETNRLVP ARETCEHCHW PEKFSGVNLR VLTKYAPDET NTRTQTVLLM
MVGGDKYKGI HGAHVGPGIH IRFAASDPKR QTITRVQYEN ESSGLKEEFV ASDSQKAAPD
GTATIEMQCV DCHNRPTHTF EMPEPGLDKA LALGEIAVTL PYVKKESAQL LQATYTSQAE
ASEKIPSQLN AYYQQNYPSV YSQRGAEVDR AGKAVLAIYN RNVFPELGVT WGTYPNNLGH
TESPGCFRCH DGSHTSSSGK TIPQDCNSCH EPLAMDEASP EILQKLGIAE RISALQRK