Gene Acid345_2388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2388 
Symbol 
ID4071386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2822091 
End bp2825579 
Gene Length3489 bp 
Protein Length1162 aa 
Translation table11 
GC content57% 
IMG OID637984404 
ProductCna B-type protein 
Protein accessionYP_591463 
Protein GI94969415 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.315243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.974277 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTTTA AGCGGATACT GAGTGTCCTG TTGCTCTGCT GGTTATGCAT GGCAGGGTTT 
TTGTTTGCGC AGTCGGACCG GGCCACGGTC ACGGGAGCTG CGCGTGATGC GTCCGGAGCG
GTATTGCCGG GCGTAAAAGT CGAAGTCACC AACATAGCCA CCAATTTGGT TTCAACCGCT
GTAACTAATA CGGACGGCAT TTATCGTATT TCGAACCTGC CCATCGGCAA CTACACGCTG
GTGTTTTCAC ACGAAGGGTT CAAGGCGGTC GAGCGAAAAG AAGTAACGCT CATGACCAGC
CAGATCGCTG AAATTAATGC AGTGTTGCAA ATCGGCGTGG CTACGGAGGT CGTGGAAGTC
ACGGGCGTCA CTCCGATGCT GCAATCGCAA GATGCGACGG TGTCGGCCAA CCTGGACGCC
AAAGCGGTCA CGGAGTTGCC GCTGAACGTG CAGGGAAGCC GCAACCTCTC GAATTTCATG
TTTGCTTATG TTCCGGGCGT TGAAGGCGGC GATTACAGCT CGCATATCAA CGGCAGCGTG
GCATTGAGCA AAGAAGTGAT GATTGATGGA ACCTCGGCTG TTTCGCAGTT GGGCGGCTAC
ATCAGCGAAT CGCAGCCACC GATGGAAGCA GTTCAGGAAT TCCAGGCGGA CACCGCGGGC
ATTGGGGCCG ATGCGGGCCG CAGCGGCGGC GGCGTGTTCC GGTATGAAAT GAAGTCGGGA
ACGAACGCAA TCCATGGGAG CCTATTTGGG TTTCTGCATT CGACATCCCT CGATGCGTTG
AGTGCCTCGA ACCGGTTGGC AGTGGTCACT GATCCGGCAA ATGCCGGCGC TTACCTGAAA
AAGAGCGATA GCCTCTCTGA CTGGGGCGGG AGCTTTGGCG GGGCAATCAT CAAGGACAAG
CTGTTTTATT TCGTCTCGTT CGAACGCTAT ATGCAGGCGA ACTGGGCGCT GGGCCCGAAT
TCGCGCACCG TTCCTACGGA CGCGATGATG GGTCTGAATG CTGACGGCAG CGTGGCGCAA
TACGCCGATC TGAGCAAGTT GCTGACGACG AGCGTAACCC ACGGCAACGA TCCCTGCGGC
AATGCGGTTT ATCGCGGATC GGTTTTCAAT CCCGCGACGA ACTGCGTGTT CGTGAACAAC
CAGATTCCAA CCAACCTAAT CAGCAAGACG TCGGCGCAGA TCCTGCAGCT TTATCACAAC
TATTACGCGC CTGAGTCGGA CCTGCCTACG AACAACGCGG GGAATGCCTA TAACCCCGAT
CCGTGGTTCC ACAACACGCA GACCAGCGTG AAGATCGACT ACAACTTCAG CGATAAGAAC
CACCTCAACG GTTCTTATTA CTATGACAAC TATCCGCGTA TCAACGCCGA CCAGGGCGGT
GCGTGGTCCG CGAACTCGCC GTACGGCGGA CCGATGGCGA ATGCCTATTG GCACAACACC
ACGGCGCCTT CGGTACGCTT GAGCGACAGC CACATGTTCA CGCCGAACGT GCTCAACGTG
GCGCATTTCA CGTGGAACCG CTTCCGCAAT CCGAGCATCG CGGTTTCGCA GTCGGACAAC
TGGGACAACA AGCTGGGCTT CTACGATGGC GCCGGCAACT TCCCGCTCAT CACCTACAGC
TCCGGCATGT ACACCAACTG GAGCCCGAAC GTTAATGGTT GGTACTACTC CAACCTGGGC
AGCCAGTTCA ACGACTACTA CGCAGCGAAC ACCTATATCT ACAACGATGA AGTTGCATGG
ACCAAGGGTC GCCACGCGAT GAAGTTCGGC GCGGAGTTCC GGGCGATGCA GTTCAACAGT
CATCCGGATT CCGAGACGTT CAACAACATA ACGTTTGACC CAACCTCCAC CGCGTCCTCA
ACTTGGTACA ACTACGGCTG GAACGAAGTC GGCAGCTCGT TCGCGAGCTT CCTGCTGGGC
GATGTCTACC AGGCCACGGG AACCGCAGTG GATCCGCAGT ATGGACGTCG CAAAGCGTTC
TCGCTTTATG CGATGGACGA CTTCCGCGTC AACGACAGGC TGACCGTTAA CATGAGCCTG
CGCTGGGATT ACAACAGTCC ATATAAAGAG AAATACGGTC ACTGGTCGAG CTGGGTGACC
GACGCGATGA ATCAAGTGTC TGGCACAATG GGCCAGTATC AGTACCTGAC GAGGGGAGAC
GAGTCCTTCG AAAAACGGCA GCAGTGGGCG AACTTCGGCC CGCATGTGGG CGCAGCCTAC
AAGATCAACG AGAAGACGGT TGTTCGCGGC AGCGTCTCGG TGTTCTTCGT ACCGCTGAAC
ATGAATACCT GGGGCGGCAT TCCTTACCAG CAGACCGGCA ACCCTGGTTA CTACCAACAC
AGCATTCAAC AGAACTTCAA CTGGGACAAC GGCTACCAGC CAGTGTTGTC GCAGGTGCAG
AAGCCTGACT ACACGCAGTG GGGCGTGGTC ATGATCGATC CGCGGTCGCT GACTCCGGGT
AACACGCAGC AGTACCAGAT CGGCGTGCAA CGCGAGTTAA CGCGTGACAC GAAGCTGGAA
GCGGTGTGGA TCCAGAGCCA CAGCTACCAT CTGCAAAGCG GTACGGTGAA CACCAACCAG
CCAACTGTGG AGAACATGCA GAACTACGTC CTCCACGGAG AATTTCCGGC CGACTACAAC
CACTATTGGG ATGCGGGTGG TCCGGGATGG CAGGGAATCA CGCCTTATCC GCAGATCGCG
GTGGGATATG GTCCGATGTT CTCGGTGGGA TCTCCGCTCG GCAACTCCGA TTACAAGAGC
TTCCAGGCGA GCGTGACCAA GCGCGCGTCG AAGGGGCTCT CGCTGATGGG CAGCTACAAC
TGGTCTCAGG CACACGGCGA CGTGGATTCG AGCATGGGCG AGTTATGGTG GGCGGGCGCC
ATTCAAAACG TGTACGACCT GAAGAACGAG GCGAAGGACA TCGCCGGGTT CGACATGACG
CACATCGTGA AGGGCTATGT CATCTATGAC CTACCATTCG GTCACGGACA AGCGTTCGGC
GGGAATGTGA GCACGCCGGT CGACTACCTG ATCGGCGGAT GGAGCTTGAA CGGCAGCTTC
CACTACAACA CGGGCACGCC GATCTCGGTG CACTCGACCA ACTCGTATCC TGGCTACAAC
GCGGTGTACG TGAACATGGT TGCGGGTTGC GATCCGACGA ACGGCAGCGC GAAGTTGTAT
AAACAGTGGC TGAACGCAGC TTGCTTCGCA AATCCGGCCA ATGCAGAGCT AGGCACGGCC
GGCAACTTCC AGGACTTCCT GCGGAATCCT GGGCTGGCAA CCGAAGATAT CGGTCTGCAT
AAAGGACTCG CGTTCGGACC GGACGGACGG TACAACTTCA CCTTCCGCCT GGAATTCTTC
AACATCTTCA ACCGTCACCA GTTTGCTGGT CCCGATACGA ACCTCGGCAG CCCCACGTTT
GGCCAGATTA CCGGCTATAC CGGGTTCGGA GGACGTACTG GCCAGTTTGG TGCGCGCTTT
ACCTTCTAG
 
Protein sequence
MGFKRILSVL LLCWLCMAGF LFAQSDRATV TGAARDASGA VLPGVKVEVT NIATNLVSTA 
VTNTDGIYRI SNLPIGNYTL VFSHEGFKAV ERKEVTLMTS QIAEINAVLQ IGVATEVVEV
TGVTPMLQSQ DATVSANLDA KAVTELPLNV QGSRNLSNFM FAYVPGVEGG DYSSHINGSV
ALSKEVMIDG TSAVSQLGGY ISESQPPMEA VQEFQADTAG IGADAGRSGG GVFRYEMKSG
TNAIHGSLFG FLHSTSLDAL SASNRLAVVT DPANAGAYLK KSDSLSDWGG SFGGAIIKDK
LFYFVSFERY MQANWALGPN SRTVPTDAMM GLNADGSVAQ YADLSKLLTT SVTHGNDPCG
NAVYRGSVFN PATNCVFVNN QIPTNLISKT SAQILQLYHN YYAPESDLPT NNAGNAYNPD
PWFHNTQTSV KIDYNFSDKN HLNGSYYYDN YPRINADQGG AWSANSPYGG PMANAYWHNT
TAPSVRLSDS HMFTPNVLNV AHFTWNRFRN PSIAVSQSDN WDNKLGFYDG AGNFPLITYS
SGMYTNWSPN VNGWYYSNLG SQFNDYYAAN TYIYNDEVAW TKGRHAMKFG AEFRAMQFNS
HPDSETFNNI TFDPTSTASS TWYNYGWNEV GSSFASFLLG DVYQATGTAV DPQYGRRKAF
SLYAMDDFRV NDRLTVNMSL RWDYNSPYKE KYGHWSSWVT DAMNQVSGTM GQYQYLTRGD
ESFEKRQQWA NFGPHVGAAY KINEKTVVRG SVSVFFVPLN MNTWGGIPYQ QTGNPGYYQH
SIQQNFNWDN GYQPVLSQVQ KPDYTQWGVV MIDPRSLTPG NTQQYQIGVQ RELTRDTKLE
AVWIQSHSYH LQSGTVNTNQ PTVENMQNYV LHGEFPADYN HYWDAGGPGW QGITPYPQIA
VGYGPMFSVG SPLGNSDYKS FQASVTKRAS KGLSLMGSYN WSQAHGDVDS SMGELWWAGA
IQNVYDLKNE AKDIAGFDMT HIVKGYVIYD LPFGHGQAFG GNVSTPVDYL IGGWSLNGSF
HYNTGTPISV HSTNSYPGYN AVYVNMVAGC DPTNGSAKLY KQWLNAACFA NPANAELGTA
GNFQDFLRNP GLATEDIGLH KGLAFGPDGR YNFTFRLEFF NIFNRHQFAG PDTNLGSPTF
GQITGYTGFG GRTGQFGARF TF