Gene Acid345_0434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0434 
Symbol 
ID4069660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp512966 
End bp514345 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content58% 
IMG OID637982438 
Producthypothetical protein 
Protein accessionYP_589513 
Protein GI94967465 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.342382 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0528043 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATAG ATGTGATCGG GCGTTTTCAG CGCCTGCGCT CGGCAATCGC TTCCTACCTC 
GCCATGGGGA TGCTGCTCGG CATCTTCCTT GCGGCAAGCA GCCAATCCAG TTTTGCGATC
GCGGCTTTCG CACGCAAGTA CGGATTGCCG TGTTCCGCGT GTCACGATGC ATGGCCCAAA
CTCAATAATT TCGGCCAGAC GTTCAAGGAC AACGGATACC AGTTGATGAA CGACCGCGAC
GCTCCGATCT GGCAGAATCC GAGCTACTGG CCGGTGGCGT TCCGTATCAC ACCGAATTGG
GACCTCGAGA ACACTGGCAG GGTGGCAACC GACCAGGCGC CCAACGGACA GTCGGTCACA
ACTCACGGTT TCAATCTAAG CGGCCTCGAC ATTCTCACCG CGGGCACGCT CAACAAAGAC
ATCTCCTTCC TGCTAGTCCC TTCTGCCGAT GAAACGGGCG CCTTCCATTT CGAATCAGCT
TGGGTACGGT TCGACAACCT GTTCCACAGC TCGTGGGCGA ATTTGAAGGT CGGCAAGTTC
GAGCTCGACA ACCTGATCTC GGAAAAACGA GGTCTCACCC TCTCAGCGAA CGGCGGCTCT
TACCAGCTTT ACCACTTCCT GCCGTTTGGC GACTCGAACC CCTACCAGTT CGGTATCGGC
GACAACCAGA TGGGAATCGA GTTCGCCGGC CACTCGAAGA ACGACTACAC GCGTTTCTCG
GCCTCGCTGC TCAGCAGTAA TGACGGCAAC GTGGACGTTC CTTACGGCAG CACGTATGAC
GGGAGCTTCA CCTTCAGCAC CGCCTTCAAC GCGGGCAGTT TAGGTCTACA GCGAATTGGG
GCGCAGGCTT ACATCGGGCA GGCCCCGACG TACTACCTCA CATCCGGTGG CGGCGATATC
CCAGGCACCG GCAAGGGCAA TCACGGCTTC AGCCGTGAAG CGGTTTTTGC CCTGCTCTAC
TTCGGCAAAT TCGACGTTAC GCCGCTGTTC GGGCACGGTA GCGAGAGCGC ATACCTGGCG
AATTACGTTT CGACCGGCGG CATGCCCCCG GTATTGCCAG CGGGCTCGCG CGATCCATCG
TGGAACAACT TCATGGTGGA ATCTCACTAC CTGTTCAACC CGCAGTTCAT CATGACGTAT
CGCTACGACG CCATCTACAT GACGCAGCAG GCGAGCACCG CGTATCCGGA TGATGCCGGC
AACACCACGG CGAATACGAT TGCTGCCCGC TATTACCCGT TCATGCATAG CCGCGCGGGT
TTCGCGCTGC ATGGCGAGTT CTCGCACCTG AACCAGAAAC ACGTTTTCTC AACCACAACC
GGAACGTTGC AGGACGTTAG CTACTACAGC GTCTTTGGCG GCATGGACTT CATTTTCTAA
 
Protein sequence
MRIDVIGRFQ RLRSAIASYL AMGMLLGIFL AASSQSSFAI AAFARKYGLP CSACHDAWPK 
LNNFGQTFKD NGYQLMNDRD APIWQNPSYW PVAFRITPNW DLENTGRVAT DQAPNGQSVT
THGFNLSGLD ILTAGTLNKD ISFLLVPSAD ETGAFHFESA WVRFDNLFHS SWANLKVGKF
ELDNLISEKR GLTLSANGGS YQLYHFLPFG DSNPYQFGIG DNQMGIEFAG HSKNDYTRFS
ASLLSSNDGN VDVPYGSTYD GSFTFSTAFN AGSLGLQRIG AQAYIGQAPT YYLTSGGGDI
PGTGKGNHGF SREAVFALLY FGKFDVTPLF GHGSESAYLA NYVSTGGMPP VLPAGSRDPS
WNNFMVESHY LFNPQFIMTY RYDAIYMTQQ ASTAYPDDAG NTTANTIAAR YYPFMHSRAG
FALHGEFSHL NQKHVFSTTT GTLQDVSYYS VFGGMDFIF