Gene Acid345_3334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3334 
Symbol 
ID4070296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3952847 
End bp3955639 
Gene Length2793 bp 
Protein Length930 aa 
Translation table11 
GC content63% 
IMG OID637985356 
Producthypothetical protein 
Protein accessionYP_592409 
Protein GI94970361 
COG category[S] Function unknown 
COG ID[COG2120] Uncharacterized proteins, LmbE homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.896794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACCT GTACAACTGC AAAGTTCATG AAATTGTCCC TGGATCGCAC ACGCCCGTTC 
GTCGCTGGTT TGCTGGCCCT CTGCCTGGCC TCCCCTTCGG CCCTACTCTC CCAGTCTGCT
GCTCCGCCCG CGCCGGAAGC CAAAGTTGTT GATGAGCTTG CGCCTTTGCC CCAGGACGAT
GGCCGTCTCG GCCTCGAACT GCTACTCAAG CGCCTGAAGA CCACCGCGCG CTTGATGCAC
ACCACCGCTC ATCCCGACGA CGAAGATGGC GGTATGCTCA CCCTCGAATC GCGCGGCAAG
GGTTACGACG TTACCCTGAT GACCCTCACC CACGGCGAGG GCGGCCAGAA CAAGACCGGC
AGCAATCTCT TCGACGAACT TGGCGTGCTC CGCGTCCTCG AACTGCTGGA GAGCGATAAA
TATTACGGCG TCCACCAGCG CTTCAGCATG GCCGCTGATT TTGGCTTCAG TAAGAGCCCG
CAGGAGTCGC TGGAGAAGTG GAAGGACGGC GGCAAGCCCG GCTACATCCC GCTGCGCGAC
ATGGTGCGCG TCATCCGAAC TTTCCGTCCT GACGTCATCG CCCAGCGCTT CCAGGGCTCC
GAGCGCGACG GCCACGGCCA TCACCAGGCG TCGGGCATCA TCACCAAGGA AGCGTTCCGC
GCCGCCGCCG ATCCCAAGCA ATTTCCGGAA CTCAACCTTC CGCCGTGGCA GGCTAAAAAG
CTCTACATGG ACAACGTCCG CGATGGCGAG GAGTACACGG TCGCCTTCGA TACCGGCGCG
GAAGACCCCG CCCTCGGCAT GAGCTACGTG CAGTTCGCGA TGAAGGGCCT GAAGCACCAG
CTCTCGCAAG GTGCCGGCGC ATGGAACGTC GATCCCGGCC CGCACATCAC GCGGTACAAG
CTGATTGACT CAGAGTTGCC GCAACCCAAA GCCGGCGAGC ACGAAAAAGA TTTCTTCGAC
GGCATCGACA CCTCACTGCC CGCGCTCGCC GATCGTCTCG GCGCGGATGA GAACAGAGTC
CCATGGCTGA AGCCCGGTTT GGAAGAAGTT GCGAAACTGA TTGACCAAGC CAGTGAAGCC
GAGAAGAAGG ACATGGAGAG CGCGGCGGCG CCGCTGAGCG CAGCCGCTCA AAGCCTGAGT
GCGCTGCGGA CAAGGATCGA AAATAGTCCG CTCGACGATG CATCGCGTCG TGACTTACTT
TCGCGCGTGG ATGAAAAGCG GCAGCAGGTG CGGAAAGCGG AGGCGCTCGC ACTGGGGTTG
AAAGTCAAAG TAACCACACT TGCGGAAATG GCCGGAAAAC CTTCCGTCAC CCGCGGAACT
CGCGTTCCGG TGGATGTCGC GATTGAGAAT GGCGGGTCGC AAGCGTTGCA GTTTCAGCTA
GCCGATGTCG AGGATCCTGC GGCCGCCAAG CGGGCGGTGA AGTCGCTTGC CATCGCGGCC
CATGGGCATT CAAGCGTGGT TGCCGAGAAC GTCGCGCGCG AGCCGCTCAC CAAGCCCGGG
TTCCACCGTG ATAATCCGGA GGTTGACTCG TTCTACGAAA GCTCGGACGC TGATCCCACG
ATCCCGTTCT CGGCCGGACC GCCGACCGTC TGGGTGTCTT ATCGCGCGGA GGGAAGCAAA
GCCGCGATCG ATGGCTCCCA GGAAGTTCCA GTAGAGACCG CGATCCATCA GCCTGACGGC
ACCACCCGCC TCCGCCCCCT GGCAATCGCT CCGAAGTTTT CGGTGCTAAT CGAGCCCAGC
ACCCAGACGG TTCCCACCAC CGCCAAAGAC GCCCGCAAGA TCGACGTCCA CGTGCGCAGT
ATCGCCGGCG GCCCTGCCAA AGGCACGCTG AAACTCCTCG TCCCCTACGC GTGGCGTGCG
ACGCCGCTGC AACAGGCCAT CGACTTCAGC AAGTACGGCG AAGAGAAGAC TGTCTCGTTC
CAGGTCACGC CCGGCGAACT CTCCGAGCAC AAGGCGAAGA TCGAAGCCGT GTTCGAGAGT
GAAGGGAAGA GGTACACCGA GGGCTACACC ACGGTGGAGC GCGAGGACCT CGGCACCTAC
TACTACTACC AGCCTTCGGT GCAGAAGATC AGCGAGGTCA AGGTCGCGAC GCAGCCGGGC
CTGAAGGTCG GCTACATCAT GGGCGCCGGT GACGACATCC CGACCGCGCT GCAGCAGATC
GGCGTGGACG TGGCAACGAT CACGCCTGAA GAACTCGCGA GCGGCGACCT CTCGAAGTAT
GGGACCATCG TCGTCGGCAT TCGCGCCTAC GACACGCGTG ACGATGTGAA GCAGCACAAC
GATCGCCTGC TCGAATTCGT GAAGAATGGC GGCACGCTCA TCGTGCAATA CAACGCCGGC
GTCGCGGACT TCAACGCCGT CAAGCTGCCG CCGCAGACCG GACCGGTGCC ACCCAAGATG
GACCCCAATT CGCCGATCCC CGGAAAGTAC GTGCCGTATC CGGCGGAGCT GAGCCGCGAC
CGAGTGAGCG TGGAAGACGC GAAGATCACG GTGCTCGAGC CGCAGAACGC GATCTTCCAT
TCGCCGAACG AGATCAAGCC CACCGACTTC GACGGCTGGG TGCAGGAGCG CGGCCTCTAC
TTCATGGACC GCTGGTCGCC GGAGTATCAC GCGCTGCTCG AGTGCCACGA CCCGAACGAG
CCCGAGCGCA AGGGCGGCCT GCTCGAAGCG AAGTACGGCA AGGGGACGTA CATCTACACC
GGCTACGCGT TCTTCCGGCA ATTGCCGGTG GGCGTGCCCG GCGCGGTGCG GCTGTTCGTC
AACCTGCTGA GCGCAGGGCA TGAGCAACAC TGA
 
Protein sequence
MQTCTTAKFM KLSLDRTRPF VAGLLALCLA SPSALLSQSA APPAPEAKVV DELAPLPQDD 
GRLGLELLLK RLKTTARLMH TTAHPDDEDG GMLTLESRGK GYDVTLMTLT HGEGGQNKTG
SNLFDELGVL RVLELLESDK YYGVHQRFSM AADFGFSKSP QESLEKWKDG GKPGYIPLRD
MVRVIRTFRP DVIAQRFQGS ERDGHGHHQA SGIITKEAFR AAADPKQFPE LNLPPWQAKK
LYMDNVRDGE EYTVAFDTGA EDPALGMSYV QFAMKGLKHQ LSQGAGAWNV DPGPHITRYK
LIDSELPQPK AGEHEKDFFD GIDTSLPALA DRLGADENRV PWLKPGLEEV AKLIDQASEA
EKKDMESAAA PLSAAAQSLS ALRTRIENSP LDDASRRDLL SRVDEKRQQV RKAEALALGL
KVKVTTLAEM AGKPSVTRGT RVPVDVAIEN GGSQALQFQL ADVEDPAAAK RAVKSLAIAA
HGHSSVVAEN VAREPLTKPG FHRDNPEVDS FYESSDADPT IPFSAGPPTV WVSYRAEGSK
AAIDGSQEVP VETAIHQPDG TTRLRPLAIA PKFSVLIEPS TQTVPTTAKD ARKIDVHVRS
IAGGPAKGTL KLLVPYAWRA TPLQQAIDFS KYGEEKTVSF QVTPGELSEH KAKIEAVFES
EGKRYTEGYT TVEREDLGTY YYYQPSVQKI SEVKVATQPG LKVGYIMGAG DDIPTALQQI
GVDVATITPE ELASGDLSKY GTIVVGIRAY DTRDDVKQHN DRLLEFVKNG GTLIVQYNAG
VADFNAVKLP PQTGPVPPKM DPNSPIPGKY VPYPAELSRD RVSVEDAKIT VLEPQNAIFH
SPNEIKPTDF DGWVQERGLY FMDRWSPEYH ALLECHDPNE PERKGGLLEA KYGKGTYIYT
GYAFFRQLPV GVPGAVRLFV NLLSAGHEQH