Gene Acid345_3312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3312 
Symbol 
ID4070274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3924335 
End bp3926083 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content58% 
IMG OID637985334 
Producthypothetical protein 
Protein accessionYP_592387 
Protein GI94970339 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.248493 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.375683 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATATTGA GCAAGACATT CAAACTGGTC CTGTTCGGCT TGCTGTTCGC GATTTCGACA 
ATCGGTAGCG TTGCGCAGTC GAGCTGGGAC GGTTCCAGCT ATGTGCCGCT CGATAGTTGG
GTCTATCCGG CGGTAGAGCG GTTGGAGTCG CTGGGGCTGC TGCGTACTCC GTTCCTCGGC
ATGCGTCCCT GGACGCGGCT GCAGTGCGCG CAGTTCGTGA CGGAAGCCCG CCTCGGGATG
ACTGATGCCG ACCTGGCTGG AAGCGTCCGC TTCGAGATTT ACCGCGAGCT GGAGCACGAG
TTCAAGCCGG AGCTTGGGGA AATTGCCGGC GCCGGCAATG ACGTGATCGC GTTCGATGAC
GTTTATGCCC GGAGCATGCA GATCTCAGGC AAACCCTTGG CGGATGGCGA GCACTTCGGG
CAGACGATTA CGAACGACTT CGGGCGTCCC TACTGGACGG GGCAAAACTT CATAGCCGGG
GTGGAAGTGC ACGGCCAGGA CGGGCCGATC TTCGCCGCGT TCCGCGGGGA GTACCAGTAT
TCACCGGCGA TGCCGAGCCT GAGTGCCGGT GTGATTCAAA CGCTGTCAGA GATTGATCGT
GTCCCGCTTG CGGTGAATTC GTTCGGGCGG CCGCAAACGA ACACGTTCCG GCTTCTGGAT
ACCTATGTGG GGATCCGGCT GGCAGCGATG CAGGTCACGT TCGGCAAGCA GAGTTTGAAC
TGGGGGCCGA CGCAGATGGG CAGCATGTTG TTCAGCAACA ACGTCGATCC GCCTTACATG
CTGCGCATCA GCCAGGTCAC CCCGGCGCGA TGGCCCTGGA TTCTCAGATA CTTAGGTCCG
GCAAGGTCGG AGTTCTTCTT CGCCAAGATG TCGGGCCATA TGTATCCGGC GCGTCCGTTC
ATTCACGGGG AGAAGTTCTC GTTCCGGCCG ACGGAGAACC TGGAGTTCGG GATTTCGAGG
ACGACGGTGT TTTTGGGGGT CGGCCACGGG ATGACGATCG GGCGGATTGC CAAGAGCTAT
TTCAGCGTGG GGGATAACTT AACCAGCAAC GCCTCCAGCT CGGATCCGGG CGATCGTAAG
GGCGGTCTGG ACTTCTGGTA CCGGCTGCCT AAACTCCGGA ATTGGTTAAG CTTCTATAAC
GATTCGTTCA CCGATGATGA TCCCTCGCCA CTTGCGGCTC CGGCCCGGGC GCCAATGAAT
CCAGGACTGT ATTTATCGCA TGTTCCAGGC ATTCCTAAGC TCGACTTTCG AGCCGAAGCT
GCATATACTG ACCTGACCGC AGCCCATAGC AAAGGCGGCA CCTTCGTCTA TTACAATGTT
GTCTATAAAG ATACCTATAC CCAAAAGGGT TTTCTCCTAG GCAACGTGGT GGGGCGGCAA
GGCAAAGCCT ATCAGGCCAG CACTTCCTAT TGGTTTACGC CGAGGCGGCG CTTGCAGTTG
CTGTACCGGG ACCAGCAGGT GCGCAACGAT TTCATTCCGG GCGCCGGAAC GCAGCATACG
GGTCAGGCTA CGTTCGATTG GAACTTTGGG AAGAAGGTGT CCATGTCTGC CATGGCCCAG
TATGAAAGGT GGACGATTCC GCTCATAGCG GATGGACCGC AGAACGATTT CACCGCATGG
CTGCAGCTGA AGTATTCGCC GGACAAAGTA TTTGAACTAC CCGCGCTGCG CCATCATCAA
CCGGCAGCGC CGCCAGCGAC TGTGCAATAC GGAGACCAGC AATGGGTCGA TCCACAACAA
CAGAAGTAG
 
Protein sequence
MILSKTFKLV LFGLLFAIST IGSVAQSSWD GSSYVPLDSW VYPAVERLES LGLLRTPFLG 
MRPWTRLQCA QFVTEARLGM TDADLAGSVR FEIYRELEHE FKPELGEIAG AGNDVIAFDD
VYARSMQISG KPLADGEHFG QTITNDFGRP YWTGQNFIAG VEVHGQDGPI FAAFRGEYQY
SPAMPSLSAG VIQTLSEIDR VPLAVNSFGR PQTNTFRLLD TYVGIRLAAM QVTFGKQSLN
WGPTQMGSML FSNNVDPPYM LRISQVTPAR WPWILRYLGP ARSEFFFAKM SGHMYPARPF
IHGEKFSFRP TENLEFGISR TTVFLGVGHG MTIGRIAKSY FSVGDNLTSN ASSSDPGDRK
GGLDFWYRLP KLRNWLSFYN DSFTDDDPSP LAAPARAPMN PGLYLSHVPG IPKLDFRAEA
AYTDLTAAHS KGGTFVYYNV VYKDTYTQKG FLLGNVVGRQ GKAYQASTSY WFTPRRRLQL
LYRDQQVRND FIPGAGTQHT GQATFDWNFG KKVSMSAMAQ YERWTIPLIA DGPQNDFTAW
LQLKYSPDKV FELPALRHHQ PAAPPATVQY GDQQWVDPQQ QK