Gene Acid345_2949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2949 
Symbol 
ID4070873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3493710 
End bp3495617 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content57% 
IMG OID637984968 
Producthypothetical protein 
Protein accessionYP_592024 
Protein GI94969976 
COG category[S] Function unknown 
COG ID[COG4805] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTCG CCCGAATACT CCTCTGCGTC TTCTTTGTAT CCTGCTGCTC ATTCCTCTTC 
GCGCAGTCCC ACAAAACCAC ACCCGAGGCG CCTGTGACCG GAACTCCGAG CGAACGGCTG
GCCAAGCTCT CCGAACAGTT CCTTCATGAA TCACTTCAAC TTTCACCGGT ATCAGCTTCT
GGGGCGGGAT ACCACACGTA TGTCGATCCG ACGAGCGGAC GCACGGTTCG CCTTGATGCC
GAGCTCGATG AGATGGGTAC CGAAGACCTC GCGGAACAGT TGAAGTTCTA TCGCCACTGG
CGCGAGCGGT TGCGGAGCCA GGCGCCGTAC AAAAGCCTCG ACGCTCAGGG CCATGCCGAC
TGGATTCTTC TGGACGACGG AATCTCGAAC AACTTGCTCG AATTGGAGAA AGTCCAGAAC
TACAAGCACA ATCCGACCGG CTGGGTGGAG TTGATCGGCA ACGGGTTGTT CCTCAACATG
TCGCAGGAAT ATGCGCCAAA AGATCAGCGC ATGGCAGATG CAGTGTCGCG CATCGCGCAG
ATTGCGCGCT TCATTGGGCA AGCTAAGCAA CAACTCATGG ACTCAGATCC GATCTACATC
AAAGTTGCGG TGGAAGAAAA CAGCGGCAAT CTCGGGATGA TTGATGACAT CGGCAAAGAG
CTCCCGGCGA GTGGAGCGGT GCGTCAGAAG TACGATCGCT TCGCACCGGC GGCAAAGAAG
GCGCTGACGG ATTTCTCGCA GTGGATGCAG ACCGACTTGG CGAACCGTCC GACGAACGGC
CGCAACTGGC GGTTGGGGAA GGAGTGGTAT GCCGAGCGTT TCCGCCTGGT GATGGAGACG
AACGTTACTC CGGACGTGCT ACTCACCGAT GCCGAAACGG ATATGACGAG CGTCCGGGCG
GAGATGCTGG AAATTGCGGT ACCGATGCAC AAGGACATGT ATCCAGACCA CACCGATCAC
GCCGACCTGA GCGGAGTGGA TCGCGAAAAC AAAATTATTG GCGAGGTCCT GGACCGCCTG
GGGCAGGAAC ATCCGCAGCG CGATCAGTTG ATGGACTACA TTCAGGGCGA TCTCCAGAAC
ATTATTGATT TCATTCGCGA ACACAAGATC GTCGCACTGA GTGCGCGAAA CAATCTGAAG
GTGGTTGCAA CTCCGGACTT CATGCGCGGC GTTTATTCGG TGGCAGGTTT CCACGCGCCG
CCGCCGCTTG ATCCCAATAC CCAGGCGCAG TACTGGGTCA CCCCGATCGA TCCTAAGACG
GCGGATGAAA AGGCCGAGTC GAAGCTGCGC GAGTACAACA ACTACACGCT GCACTGGCTG
ACCATTCACG AAGCGCTTCC GGGACATTAC ATCCAATTCG AGCACGCGAA TAACGTGGAG
CCTCCGATGC GCAGGTTATT GCGCGCGTAT TACGGCAACG GCCCGTACGT GGAAGGCTGG
GCCGAGTACA TTGCGGGCAT CATGCTCGAC GCTGGGTTTG CTGACAACGA TCCGCGTTTC
CGGCTGATCA TGAAGAAGAT TCGTCTGCGC GTGTTGGCTA ACACAATCCT GGACATCCGC
ATGCACACAA TGGATATGAG CGACGACGAA GCCATGTCGC TCATGACCAA GCAGGCCTTT
CAGACTGACG CAGAAGCTCA AGGAAAACTT CAACGTGCAA AGCTAACTGC AACGCAGCTT
CCGACCTACT ACGTAGGCAT CCGCGGCTGG AACGATCTGC GGGCGAAGTA CAAGAAGGCG
AAGGGAACGG CATTTACGAA TCTGGAATTT CACAACCGGG CGTTGGATCT CGGTCCAGTG
CCTCTGCCGC TGGCAGGTGA GATTCTTCTG GGGATTCCGG CCAACTTGAG TGTGGGGCAG
AGCACCAGCG CTGCCCCGGC ACACAAAAGG GCGACGCGCA AAAAGTAG
 
Protein sequence
MKVARILLCV FFVSCCSFLF AQSHKTTPEA PVTGTPSERL AKLSEQFLHE SLQLSPVSAS 
GAGYHTYVDP TSGRTVRLDA ELDEMGTEDL AEQLKFYRHW RERLRSQAPY KSLDAQGHAD
WILLDDGISN NLLELEKVQN YKHNPTGWVE LIGNGLFLNM SQEYAPKDQR MADAVSRIAQ
IARFIGQAKQ QLMDSDPIYI KVAVEENSGN LGMIDDIGKE LPASGAVRQK YDRFAPAAKK
ALTDFSQWMQ TDLANRPTNG RNWRLGKEWY AERFRLVMET NVTPDVLLTD AETDMTSVRA
EMLEIAVPMH KDMYPDHTDH ADLSGVDREN KIIGEVLDRL GQEHPQRDQL MDYIQGDLQN
IIDFIREHKI VALSARNNLK VVATPDFMRG VYSVAGFHAP PPLDPNTQAQ YWVTPIDPKT
ADEKAESKLR EYNNYTLHWL TIHEALPGHY IQFEHANNVE PPMRRLLRAY YGNGPYVEGW
AEYIAGIMLD AGFADNDPRF RLIMKKIRLR VLANTILDIR MHTMDMSDDE AMSLMTKQAF
QTDAEAQGKL QRAKLTATQL PTYYVGIRGW NDLRAKYKKA KGTAFTNLEF HNRALDLGPV
PLPLAGEILL GIPANLSVGQ STSAAPAHKR ATRKK