Gene Acid345_0427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0427 
Symbol 
ID4069653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp499063 
End bp501198 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content56% 
IMG OID637982431 
Productcellulase precursor 
Protein accessionYP_589506 
Protein GI94967458 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0137531 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATGTA TTCGGAAGCA GTGTCTCTTC GTCTCCCTGA TTGCGATTTG CTCAAGCGTC 
ATTGGGCTCG GGCAAACGAT TCGGGTAGAT ATCAGCCACC CCACCAATTC GTTCGTTCCA
AACGAATCTC TCGGTGCTGG CATTGATCGT ATCGGAGCGG CAGCGGCGGC GAAGTACTTC
AGCGAGCCTC CGTCGAAAGC CGTGCTGGAT GCCGGTTGGC AGCCTGTGAC CTATCGGCAG
AACACCGAGT TGGCAGTTGA GGCCTGGCAC TGGAATCCGC AGGGCACGTG GAGCGATCCG
AGCGGCAAAG GATACTTCAC CGGTAGCGCC GAACCGGGAC CGGAGATCAT TAAGCATTCG
TACGGATATG TTTTGCCCCA TCGTGGCTTT ACGCGTAACG ATGGAACCGA CACGAGCGGC
TTCGGTAGAA TGACCGACGG CGATCTCAAC ACCTATTGGA AGAGCAATCC GTATCTCACG
AAACATTTCA CTGGCCAGGA AGATTCCGAA CATCCGCAGT GGATCGTGAT TGATCTTGCA
ACGACACAAT CCATAAACGC TTTGCGGATC GCCTGGGCAG AGCCTTACGC CAAGCAATAC
CTAATTCAGT ATTGGACCGG CGACGACGCG ATCAAATTAC CGAGCAAAGG ATCGTGGGTC
GCGTTCCCCG CGGGCGTGGT AAGCGCCGGC AAGGGTGGGA CGGTCACCCA CCAACTCTCG
AATTCGCCGA TGCCGGTGCG TTTCTTGCGC GTCTGGATGA CGGAGCCATC GAATACCTGC
GACACACATG GCTCGGCCGA TATTCGCAAT TGCGTTGGTT ATGCAATCCG CGAGGTGTTC
ATCGGGACGA CCGACAGGAA TGGCTTCCAT GATGTGGCGC GGCACACCCC CGACCAGGAT
CAGACCACGA CTTACTGCTC TTCGATTGAC CCTTGGCACG AGCCTTCGGA CTTGGGATCG
ACGCAACACG AACACGTTGG CATGGATCTG TTTTATCGCA GCGGATACAC ACGCGGCTTG
CCGGCAATGA TTCCAACCGC GTTGCTCTAT GGCACGCCCG AAGATTCCGC TGCTGAAATC
GCGTACGTCG AGAAGCGCGG CTACCCGATC TCCTATGTGG AACTCGGCGA AGAGCCGGAT
GGCCAGTACA CGCTTCCGGA AGACGACGCT GAACTCTACC TGCAATGGGC GAGGGCAATC
CACAAGGTGG ATCCGAAGCT GAAACTTGGC GGGCCGGTAT TTACGGGCCA GAACGAAGAC
ATTTTGTCGT GGCCCGATGC GCAAGGTCAG ACATCGTGGA CGCGCCGGTT CCTGAACTAT
TTGAGGGCCC GCGGCGGTCT GCGAGAGCTC GCCTTCTTCT CGTTCGAACA CTATCCATTC
GAACCGTGCA AGGTGAATTG GAGCAGCCTT TACGACGAGC CGCAGCTCAT GACTCACATC
ATGCAGGTTT GGCGTGACGA CGGTCTGCCT GCCGATGTGC CAATGTTCGT TACCGAATCG
AACATCACGT GGAACAGTGG CGAGTCGTCG GTAGACATCT TCGGAGCGCT CTGGCTCGCG
GATTATGTTG GCTCGTTTTT CACCGCAGGC GGCAAGGGAC TTTACTACTT CCATTATTTA
CCGCTGGGTG TGCACCCGGG GTGTAATCAG TCCGGCGGTA CGTTTGGTAT GTTCACGACG
AAGGGCAATT TCGAAGTCGA CAAGCCGACG TCGCAGTTCT TCTCGAGCCA GTTAATCAAC
ACCGAATGGG TGCAGCCTGG TGACGGAGTG CACGAAACCT ACGCCGCGAC TGGCGATCTC
ATGGACGCCG CCGGGCACGC CTTAATCACC GCCTACGCCG TGAAGCGCCC TGATGGCCAA
TGGTCGCTGC TCGTTGTGAA TCGCGATCAG GAGAATGCGC ATAAGGTGAC GATCGATTTC
TCTGATTCTG GGCGGGGCAA GACTGGATTT GCAGGGCCAG TGCAGTTGCT GACCTTCGGC
AGCACGCAAT ATAAGTGGAA TCCCACAAGG GAAGGCGGCT TCCCCGATCC AGATGGGCCG
GTTGCAAAAT CCAGCATTAA CGCATCGGCC GACACGGTTT ATGAATTGCC GAAAGCATCC
ATGACTGTAA TTCGCGGGTC GCTTTCACAT CAGTAA
 
Protein sequence
MKCIRKQCLF VSLIAICSSV IGLGQTIRVD ISHPTNSFVP NESLGAGIDR IGAAAAAKYF 
SEPPSKAVLD AGWQPVTYRQ NTELAVEAWH WNPQGTWSDP SGKGYFTGSA EPGPEIIKHS
YGYVLPHRGF TRNDGTDTSG FGRMTDGDLN TYWKSNPYLT KHFTGQEDSE HPQWIVIDLA
TTQSINALRI AWAEPYAKQY LIQYWTGDDA IKLPSKGSWV AFPAGVVSAG KGGTVTHQLS
NSPMPVRFLR VWMTEPSNTC DTHGSADIRN CVGYAIREVF IGTTDRNGFH DVARHTPDQD
QTTTYCSSID PWHEPSDLGS TQHEHVGMDL FYRSGYTRGL PAMIPTALLY GTPEDSAAEI
AYVEKRGYPI SYVELGEEPD GQYTLPEDDA ELYLQWARAI HKVDPKLKLG GPVFTGQNED
ILSWPDAQGQ TSWTRRFLNY LRARGGLREL AFFSFEHYPF EPCKVNWSSL YDEPQLMTHI
MQVWRDDGLP ADVPMFVTES NITWNSGESS VDIFGALWLA DYVGSFFTAG GKGLYYFHYL
PLGVHPGCNQ SGGTFGMFTT KGNFEVDKPT SQFFSSQLIN TEWVQPGDGV HETYAATGDL
MDAAGHALIT AYAVKRPDGQ WSLLVVNRDQ ENAHKVTIDF SDSGRGKTGF AGPVQLLTFG
STQYKWNPTR EGGFPDPDGP VAKSSINASA DTVYELPKAS MTVIRGSLSH Q