Gene Acid345_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0472 
Symbol 
ID4069467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp585199 
End bp587193 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content59% 
IMG OID637982476 
Productcytochrome c family protein 
Protein accessionYP_589551 
Protein GI94967503 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.482092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACCCG AGAATCCAAA CGGTCTGCGA ACGAATGCGA GTATTCATCA CCCCGAGCCA 
GGAGCGAGCG AAGGCGGTCC TCGCCTGCCA CTCCCGCTGG TGTGGTTCTT TGGGTTATTG
ACCCTGCTGC TGCTCCTGCC GCTCGCGGCC AAAGCGCAGA TCTCGCCTGG ACCGCTTTCT
CAAGCGCATC AGGATTTGAG CAGTTCGGCC TCCTGCACGA AGTGCCACTC GGTCTCGCCG
TCGTCGCCGA ACTTTCGGTG CCTGGATTGC CACCGGGACA TCGCAGCCCG CTTGCAACAG
AAACGCGGAT ATCATCCGGC GCTGGTGGGC TCGCAGCCCG GCTCGTCTTC GTGCGTGAAA
TGTCACAGCG AACACAACGG CGCGAACTTC GCTCTCGTGA AGTGGGACTC GAAGCACTTT
GACCATGCAA AGGCTGGCTT CGAACTCGAC GGCAAGCACG CTGCGCTGGA TTGCGCGCAG
TGCCATTCAG CGAAGAACAT CACACCCAGC GAACGTTTGA CGCTGAGCGC TCGAAACGCA
AACGACACCT ACCTGGGACT TTCGACCGCG TGCACCACCT GTCACGAAGA CAAACACAAT
GGGCGCCAGG GAGCTAATTG CCAGCAATGC CATGATGAGC GTAGTTGGAG CGCCGCTTCG
AGGATCGACC ACGCGAAGAC TCGCTACCCT CTGACCGGAG CGCACGCCCA GGTGAAATGC
CAATCCTGCC ACGTGCCCCA AGCGGATGGC AAAGTGAAGT ACGTCGGACT TCGCTTCGAC
GGCTGCGAGT CGTGTCACAA GGATGTACAC CAGGGCGAAT TCAGCAATCG CGCATGCCAA
AGCTGCCACA GCACTGGTGG GTGGAAGCAG ACCTCTTTCG CGCGCGAGTT CGATCACTCA
AAAACCAAGT TCATGCTCGC CGGCAAGCAC GCGGAGGTTG CGTGCAATGC ATGTCACAGA
GCTGGAGATT TCAAGGCTCC GATCGCTCAC GACCTTTGCG CCGATTGTCA CAAGCTCGAT
CCACACAATG GACAATTTGC AAAGCGGGCC GACGGTGGAA AGTGCGAAAG CTGTCACACG
GTAGAGGGCT GGAAAACTTC GAGATTTCTG GCCGCTGACC ATGCGAATAC CGGATTTCCT
CTTCGCGGCA AACATAGCAG CGTGGACTGT GCGAAGTGCC ACGTTCCGGC GGGCAAGGCG
ACGCTCTTCA AAGTGAAGTT CGCGCTTTGC ACCGACTGTC ACAAGGACGC ACACCAGGCT
CAGTTCGCCG GCGCGCCTTA CCTTAACAAG TGCGAGAAGT GCCACACCGA AAAAAGCTTC
CACGCGCCGA CATTCACCCT CGCGGAACAC CAGAAGAGCG GTTTCGTTCT GACGGGTGGT
CACCTGGCGG TAGCTTGCAT CGAGTGCCAC AAGGCGGCTG GCGATTCCCA ATCCGTGGCC
TATCACTTCA ATCGCCTCAC CTGTGCCACA TGCCACTCAG ATCCACATCG CGGCCAGTTC
CGCGCGCGGA TGGAACGTAT TACTGAGCAT GGCGAAGCCG CCGGTTGCGA GGCCTGCCAC
ACGACGAAGC GATGGAACGA CCTGCAACGG TTCGATCACT CCTCCACCAA GTTTGATCTC
TCCGGCGCTC ACAAGGCGGT TGAGTGCATC GGCTGCCACC GCCCGCCGGC CATGGAGCGC
AAGCTCATGA ACGTGGATTT CCAGGCCGCT CCCACATCTT GCGAGCAGTG TCATAACGAC
CCACATGGGT CGCAATTCGC CCACGCCGAT CGTGTGACGC GCTGCGCGGA ATGCCACGAC
GCGAACCGCT GGAGGCCTTC GCACTTCGAT CACGAGAAGA CCCTCTTTTC GCTGAAGGGC
GCACACCAGA ACACGCCCTG CAAGGGGTGC CACACGCAAT TCCGTGAAGT AGCGAACAAA
CAAGTGCTTT TCTACAAGCC CACGCCTACG AAGTGCGCGA GTTGCCATGC CAGCTCCGCA
ATACGCGCCA GCTAA
 
Protein sequence
MAPENPNGLR TNASIHHPEP GASEGGPRLP LPLVWFFGLL TLLLLLPLAA KAQISPGPLS 
QAHQDLSSSA SCTKCHSVSP SSPNFRCLDC HRDIAARLQQ KRGYHPALVG SQPGSSSCVK
CHSEHNGANF ALVKWDSKHF DHAKAGFELD GKHAALDCAQ CHSAKNITPS ERLTLSARNA
NDTYLGLSTA CTTCHEDKHN GRQGANCQQC HDERSWSAAS RIDHAKTRYP LTGAHAQVKC
QSCHVPQADG KVKYVGLRFD GCESCHKDVH QGEFSNRACQ SCHSTGGWKQ TSFAREFDHS
KTKFMLAGKH AEVACNACHR AGDFKAPIAH DLCADCHKLD PHNGQFAKRA DGGKCESCHT
VEGWKTSRFL AADHANTGFP LRGKHSSVDC AKCHVPAGKA TLFKVKFALC TDCHKDAHQA
QFAGAPYLNK CEKCHTEKSF HAPTFTLAEH QKSGFVLTGG HLAVACIECH KAAGDSQSVA
YHFNRLTCAT CHSDPHRGQF RARMERITEH GEAAGCEACH TTKRWNDLQR FDHSSTKFDL
SGAHKAVECI GCHRPPAMER KLMNVDFQAA PTSCEQCHND PHGSQFAHAD RVTRCAECHD
ANRWRPSHFD HEKTLFSLKG AHQNTPCKGC HTQFREVANK QVLFYKPTPT KCASCHASSA
IRAS