Gene Acid345_3646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3646 
Symbol 
ID4072249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4313889 
End bp4315649 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content60% 
IMG OID637985669 
Producthypothetical protein 
Protein accessionYP_592721 
Protein GI94970673 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGAC TTGTTTCGGC ACTCTTCCCT GCAGTTTTCC TCGCGGTCCT TTCCGGTTGT 
GGCGGTTCAT CGAGTTCGCC GACACCCACG CCGAAGCAGT CGATTCAATA CGCGGTGAGC
CCTTCTTCCA TTGACTTGCA GCAGGATGGG ACGAAGAGCA CCGTTTCGGT CACGGCAACC
TGCCAGAACC TCGATACGGT TTCGGCCGAC ATCACAGGGC TGCCCACGGG CGTAACCGCG
AGCGTAACGC AACCGACGTG TTCCACGGCC GGGAGCATTG ACTTCACCGT TACGGATGTT
GCGAACGCGC GCGCAACGTC CTATACGGTA GTTGTTGGTA CTCCGAACGC CGCGAATGTC
ACGACCGCGA AGCTGGCCAT GAACGTCGTT GCGCAGGCCG CGGTGACGCG GACGGCGACA
GGCTTGAAGA CCGCGTTCAT GTCCACCTCG TTCCAACTCG CGGACTGGTC GTACTCGTGG
CTGAACGATC ATCCGGCGAC GATTCCGCCG CTCAACAATC TTGCAGAGCA GCACATCCGC
ATTCAACTGA TCGACGGCGC AGTACCGCAG ATCGACGCCG CGAATTGGGA TTTCACCAAG
GCGGATGCGA CCATCCAGCC GCTGCTTGCG GTGGGCGACG ATAGTCCTGA GCTGCAGATC
GGCACGGTGC CGGCCTTCCT GGGTGACAGC AGTGGCCACT ACGTGGAAGC GAACCTGCCG
GCATTTGCCG AGTACTGCGC GAACCTGGTG CGCTACTACA ACAAGGGCGG CTTCAGCGTG
GGCGGCAAGC TCTACAAGTC CTTGAGCAGC ACGCCGATCA AGTGGTGGGG GATTTTTAAC
GAGCCGAACT GGAACAGCGT GACTCCCGCG CAGTACCCCA CGATGTACAA CGCAGTCGCG
GCGTCGATGC TCGCGGTGGA TCCCGACATC AAGTTAGTCG GGCTGGAACT GGGCGATGTG
ACGGGCATGG CGCAGAGCTA CATGCCGCCG TTGTTGAGCG GCGTGACACA GCCGATGCAT
GCACTGGCCA GCCATTACTA CAGCACGTGC AACCAGAAAG ATTCCGACGT GCAGTTGTTC
TCACAGGTGC AGATGTTCCA CGACCAGACC GCTTACATCC GGACTACGCT GGATGCAAAT
GCGCCGACGG CCGGTCTGCC GATCTGGATA ACCGAGAACA ACGTGAATGC CGACTACGAC
AAAGGCGGCG GTATTAGCGC CTGTAATGGC GGTGCGTTCA CCGAAGATGA CCGCGGCACA
AGCGCATACT TCGCGGCGTG GCGTCCGTTC GTCTATTCCC AGATGGTGCA CGCCGGAGCG
GCGGGCATTT GGCATTGGTC GTTTTACGGC GGTGGGCAAT ACGGCGAGTA TGCCGACGAC
AGCACGCCGT ATCTCAGCTA CTGGGTGGAC TACGAACTTT CGCACCTGCT CGGACAAGAG
AGCATGACGG AGGTCAGCAG CAGCGTGACG GAGCCGAGTC GCATCGAGAT GTTCGCGGCG
AAGGCCGCGG ACGGATCCCG CGTGATCATG GTGGTGAACC ACGACGTGGC TGCCGATTCC
GACAACAACG GGCAGGGCGT GCCGAAAAAA GTGCAACTCG ATCTCAGTGG CGCCGGTTCG
TTTACAACGG CCACATTGAT CGCCGTCGAT AAGAGCACCA GCATAGCCAC TGGGCCTACG
ACGACGACGC TCACGCCAAC CGCCGGCGTC GTCACGCTGA CTTTCCCCGG CTACGGCGTG
CAGTTCGTAC AGCTCAAATA G
 
Protein sequence
MRRLVSALFP AVFLAVLSGC GGSSSSPTPT PKQSIQYAVS PSSIDLQQDG TKSTVSVTAT 
CQNLDTVSAD ITGLPTGVTA SVTQPTCSTA GSIDFTVTDV ANARATSYTV VVGTPNAANV
TTAKLAMNVV AQAAVTRTAT GLKTAFMSTS FQLADWSYSW LNDHPATIPP LNNLAEQHIR
IQLIDGAVPQ IDAANWDFTK ADATIQPLLA VGDDSPELQI GTVPAFLGDS SGHYVEANLP
AFAEYCANLV RYYNKGGFSV GGKLYKSLSS TPIKWWGIFN EPNWNSVTPA QYPTMYNAVA
ASMLAVDPDI KLVGLELGDV TGMAQSYMPP LLSGVTQPMH ALASHYYSTC NQKDSDVQLF
SQVQMFHDQT AYIRTTLDAN APTAGLPIWI TENNVNADYD KGGGISACNG GAFTEDDRGT
SAYFAAWRPF VYSQMVHAGA AGIWHWSFYG GGQYGEYADD STPYLSYWVD YELSHLLGQE
SMTEVSSSVT EPSRIEMFAA KAADGSRVIM VVNHDVAADS DNNGQGVPKK VQLDLSGAGS
FTTATLIAVD KSTSIATGPT TTTLTPTAGV VTLTFPGYGV QFVQLK