Gene Acid345_3447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3447 
Symbol 
ID4070331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4066231 
End bp4067955 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content58% 
IMG OID637985469 
ProductZn-dependent hydrolase 
Protein accessionYP_592522 
Protein GI94970474 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.453465 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTCGC ACAAGAAGCG GTCGCTAACC TGGATCCTCA TCGCTCTCTT ATCGGTCTCG 
AGTGCTGTCT GTCAGACCAC GAAGAAACCC GTCCACAAGA AATATCCGAT CGGCGGATCG
GCGCCGAACG GCGCACGGCT CAAAGCCGCT GATCTCGATG CGCGCCTGGC GAAATGGCGG
CGCACGCCAA TGCCGTTCGA CAGTGAAAAG CTGGCGGCGC GCGATGTGTG GATGATCCAG
AAGCTGGTGA CTGCATGCCA GTATCTGGAT GCGATCTATT GGCGGCAGTC CGATCCCGAT
GGGCTGACGC TCTACAAGCA GCTTGAGTCG AGCAAGATTG CACGCGATCA GAAGATCGTG
CGCATGCTGC AAATCAACGG CAGCCGCTGG GACCTCCTCG ACAACAGCCA GCCGTTCGTA
GGCGATGAGA AGATGCCGGC TGGCCATGCG CTCTATCCGG CGGGCATCAC ACGCGACGAA
ATTGAGAAGT ACGTCAAAGA TCATCCGGAA GAGAAAGACG CGATCTACAA CGAGCGCACG
GTACTTCGAC GCAATGGCAG TGAGTTGCAG GCGATTCCGT ATCACGTGGC GTATCGCGCG
TTTCTGGAGC CGGCAGCGCG GGCGTTGAAG GAAGCCTCGG CGCTGGCGCG CGACAAGGCC
TTCGCAAACT TCCTTCGCAT GCGCGCCGAT GCGCTGCTGA ACGACGATTA CTATCCGAGC
GATGTGGCGT GGCTGGAGCT GCAGAACCCA CGTTTCGACA TCATCATGGC GCCGTACGAG
ACCTATCTTG ACGACCTGCT TGGCGTGAGG ACGAGTTACG GCGCAGCGGT GTTGATCCGC
AATGATCACG AGAGCCGGCG GCTTGATCTC TATGAAAAGT ACGTTCCGGA GATGCAGCAG
GCACTGCCGC TGACTGCGGA GGACAAGCCT TCCAAAGAAG GCCAGCGCAT GCCGATGGAA
GTGATGGATG CGCCGTTCCG CACCGGCGAT CTTGGCCATG GCTACCAGGC CGTCGCCGAC
AATCTGCCGA ATGATCCGAA GATCCACGCC GAGAAGGGCA CCAAGAAGAT CTTCTTTAAA
AACTTTATGG ATGCCCGCGT CAACTACGTC GTGATTCCGA TGGCGCAGCT TGTGATGGAC
TCGGTGCAAG CGACGAAGGT CACGGCAGAA GGCTACCTGG CGACGACCCT GATGCACGAG
ATTGCGCACG GTTTGGGGCC GGCCTTCGCG CGCGGGCCCA ACAAACTGGT GGATATCCGG
GAGGCGATTG GCGCCAGCTA CAGCGGCCTG GAAGAAGCCA AGGCCGACAC TGCCGGCATG
ATTTGCCTGC AATGGATGAT CGACCACGGA TATATCCCCA GGACGAAGTC CGACGAGTAC
TACATTACGT ATGTCGCCGA CCTCTTCCGG GCGATGCGTT TTGGCGCTGG AGAAGCCCAT
TCCGCCGCCG AGACGATGGA ATTCAACTAC CTCGCTGAGC AGGGCGCCAT CAAGCGCGAC
GCAAATGGCC GATATTCGGT GGACACGGGC AAAATTCCGG CCGCCGTAGC TGCATTAGCC
AAAGAGTTAC TAGAGATCGA GGCCACTGGG GACCGCGATC GCTGTGAAAA ATGGTTCTCC
CATTACGGAA GTTTTCCGCC GGAGTTGACC AAATCTCTGG ACGCGGCGAA GAATGTTCCA
GTAGATATAG ACCCGGTCTT TTCCTTCCCC AGGAAGCTCC AGTAG
 
Protein sequence
MDSHKKRSLT WILIALLSVS SAVCQTTKKP VHKKYPIGGS APNGARLKAA DLDARLAKWR 
RTPMPFDSEK LAARDVWMIQ KLVTACQYLD AIYWRQSDPD GLTLYKQLES SKIARDQKIV
RMLQINGSRW DLLDNSQPFV GDEKMPAGHA LYPAGITRDE IEKYVKDHPE EKDAIYNERT
VLRRNGSELQ AIPYHVAYRA FLEPAARALK EASALARDKA FANFLRMRAD ALLNDDYYPS
DVAWLELQNP RFDIIMAPYE TYLDDLLGVR TSYGAAVLIR NDHESRRLDL YEKYVPEMQQ
ALPLTAEDKP SKEGQRMPME VMDAPFRTGD LGHGYQAVAD NLPNDPKIHA EKGTKKIFFK
NFMDARVNYV VIPMAQLVMD SVQATKVTAE GYLATTLMHE IAHGLGPAFA RGPNKLVDIR
EAIGASYSGL EEAKADTAGM ICLQWMIDHG YIPRTKSDEY YITYVADLFR AMRFGAGEAH
SAAETMEFNY LAEQGAIKRD ANGRYSVDTG KIPAAVAALA KELLEIEATG DRDRCEKWFS
HYGSFPPELT KSLDAAKNVP VDIDPVFSFP RKLQ