Gene Acid345_1379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1379 
Symbol 
ID4068914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1670154 
End bp1671497 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content58% 
IMG OID637983388 
Producthypothetical protein 
Protein accessionYP_590455 
Protein GI94968407 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.803434 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTGCAG ATCCTGCTCG TCTCGATTTC GAAGACATCC AGCATTTCCT TATTGCGCGG 
CCACCGGCGC TGGCAGCTCG CTACGAATTC CTGACATTTC GGAAAGGCGC GGCGGGACGT
GCGTGGCTGG CGGGATTGCT GGATAAGGTA GGAACTGCTG CAGCGGTGGC GGCCCACACC
ACCCTTGATG CGCGATGGGT GAGTATCGGG TTTACGTGGA ACGGATTGCG CGCACTTGGA
TTATCGGAAG ATTCTTTAGG CACTTTTCCG GAAGAGTTTC GCCAGGGAAT GGCAGGACGG
GCGCAAGTTC TCGGTACAAC CGGTGCGAAC CACCCAGACC ACTGGGTTGG AGGACTGGCA
AGCGGAGAAT TACATGCCAT CATCGTGCTG TTTGCGCGTG ACGTTGCGGA GCGAGAGCGC
TGTCGGGCGG AGCACGCGCG ATATCTTGCG CAATGCGACG GAGTGGAACT TCTATCGTCA
TTGGATCTGG AAGCGATTCC GCCGTTTGAT CATGCTCACG AACACTTTGG CTATCGTGAT
CGACTCTCGC AACCAGTGAT TGAGGGAACA GGAGTGGTGC CTACTCCTGG GTCGGGCCAG
CCTCTGAAAG CAGGCGAGTT CTTCCTGGGT TATCCCGATG AAGACGGTCC CGCCGTCGGA
ATGCCGCAGC CTGAAGTTCT TTCTCGGAAT GGTAGTTATG CCGCCTATCT CAGGATGGAA
GAGCATGTGG GTGCATTTCG CGATTTCCTC AAGCAGCATG GTGAGACACC CGAACAACAG
GAACTGGTAG CCGCCAAGCT CATGGGACGG TGGCGAAGCG GCGCTCCGCT GGTCGTGACA
CCGGACAAAG ACGATCCCGT CCTTGGCGCT GATTTGCAGC GGAGTAATGA TTTTGCCTAT
GCGACCCAGG ATCCCCACGG CTATGGCTGT CCGCTTGGTT CTCACATTCG CCGCATGAAC
CCGCGTGATA CCGCGGTGAA TATGAACCGT CGGAAGATGA TCCGGCGCGG GGGAACGTAT
GGTCCTCCCC TGCCAGAGGG TGCAGCGGAC GACGGAGTTG AGCGAGGGAT CGCGGCGTTC
GTTGGTTGCG CGAGCCTGGT CCGCCAATTT GAATTTGCGA TGAACGTGTG GGCGAACGAT
CCAACCTTCC ACGAGCTAGG CAATGAGCGC GATCCGTTTT TAGGCACTCA GGACGGGACG
TTTGACATGA CGATACCCAA GCGCCCGATC CGAAAAAAGA TCAAAGGTCT GCCGGCCTTC
ACGACGATAC GCGGAGGGGC CTACTTCTTC CTGCCTGGGA TCAAGGCACT CCGATATCTG
ACATCCCTGA ACGATAGCCA ATGA
 
Protein sequence
MSADPARLDF EDIQHFLIAR PPALAARYEF LTFRKGAAGR AWLAGLLDKV GTAAAVAAHT 
TLDARWVSIG FTWNGLRALG LSEDSLGTFP EEFRQGMAGR AQVLGTTGAN HPDHWVGGLA
SGELHAIIVL FARDVAERER CRAEHARYLA QCDGVELLSS LDLEAIPPFD HAHEHFGYRD
RLSQPVIEGT GVVPTPGSGQ PLKAGEFFLG YPDEDGPAVG MPQPEVLSRN GSYAAYLRME
EHVGAFRDFL KQHGETPEQQ ELVAAKLMGR WRSGAPLVVT PDKDDPVLGA DLQRSNDFAY
ATQDPHGYGC PLGSHIRRMN PRDTAVNMNR RKMIRRGGTY GPPLPEGAAD DGVERGIAAF
VGCASLVRQF EFAMNVWAND PTFHELGNER DPFLGTQDGT FDMTIPKRPI RKKIKGLPAF
TTIRGGAYFF LPGIKALRYL TSLNDSQ