Gene Acid345_2980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2980 
Symbol 
ID4068881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3528379 
End bp3531759 
Gene Length3381 bp 
Protein Length1126 aa 
Translation table11 
GC content56% 
IMG OID637984999 
Productintegrin-like protein 
Protein accessionYP_592055 
Protein GI94970007 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.16679 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCAAG TGGCTGCGGT TAGTTTCTTT TTAGTTTTCA CTGCGGTATT CAGCGCATAC 
TCTCAGGATC TCACTCAGGC AAAGCGCACG CCTGCGGCGT CTGGCTCGAC GCAACCCAAC
GTATTTCTCA CACCCAAACA ATATCCCGCT GGTCCTTCCG GTGTCACTTC GATCGCTAAG
GGCGACTTCA ATAACGATAG TTACATGGAT GTCGCGGTAA CTAACGTTTC TGGCACGATC
ACGGTTCTCC TGGGCAAAGG CGATGGCACT TTCCAAGCCC CCGTATCGTA CCCAGCCTTA
TCTTCCCCGG TCTCGATCGC CGCGGCAGAT TTGAATGGGG ACGGAAAATT AGACTTAGCG
GTAGCAAACA GCGGTAGTGG GAGCATTAGC GTATTCCTCG GAAATGGCGA CGGAACCTTC
CAATCGCACA CGGATGTTGC CGTCGGCACG AGCGTGCAAA TGTTGACCGT TGCAGACTTC
AACGGCGACG GCAAACCCGA CCTTGCGGTT TTAGTCGATG GCATGGTGAG CGTCCTGATC
GGGAAAGGCG ACGCCACCTT CAACGCGATA GGCGAGTACG CGAAACCATG CGCTACCTAT
TTGGCCACAG GGGATTTCAA CGGAGACGGC AAGACCGACA TCGTGGCAGG ACGTCAGTGC
GTCCTACTGG GCAATGGCGA CGGCACTTTT CAACCGCCTG TAGGTTCCCA GAAGATTGGG
AACACGGTGA GTACTGCAGT AGGCGACATC AATGGTGACG GCAAGCTCGA CTTGATAGAG
GGTGGTATCG GCGACTCGGA TGGAACTCCA AGGGCCCTGG TTGTCGTGCT CCTGGGCAAT
GGAGACGGAA CGTTTCAACC GCCCCAAGGC TTTTTTGGTT ATGGGAGCGG TGTACAGGGA
TTATTGCTTG CCGATGTGAA CGGCGATTCC CATCCAGACA TCGTGCTGTC CAGCTCTGAA
AACGTGGAAG TTGTAAACGG AAAGGGGGAC GGCACCTTTG AACCGGGTGT GCTCTACCCC
GTGGGGAACC GGCCGGTAGC AGGAGGGTTG GTGCTGTCCG ATTTTACCGG CAGCGGCAGG
CTTGATCTCG CAGTTTTGAC CTCGTGCGCT AACCCGTCTA TTTGTGGAGA CGGAGCCGTG
ACCCTATTGA GAGGCAAAGG GGACGGCACG TATGTTGCGC CAGCAAGTTA TTACATTTTT
GAGGGTGACG AACGTTTCGA CGCTGTAGGG GGATGGGTCG CGGTGGGTGA CTTCGACGGC
GACGGGAAAC TCGACGTACT CGAAGTATTC GACACTCGCG CCTTCATCTC TCTAGGAAAC
GGGGATGGCA CCGTGCAAAC GGGCCAACGT TATTATCAAG TTGCATATCA GTCCGATGGG
GCGGTAGTGG GAGATTTCAA CGGCGATGGC AAACTCGACG CTGCCATTCT ACACTCTTGC
GACGTCTTTT ATAGCGACCC TGGCCCAGGC AATCCTCCCC CCTATTGCGT TAGCGCAGGT
TCGGTAGGGG TGCTATTGGG AAATGGAAAC GGAACTTGGC AGCGGTGGCC GGAGAGCCTG
TACTTTGGAG TTGGAGACAC GCCTACATCT ATTGCGACGG GTGACTTCAA TCAAGATGGC
AAACTCGACC TTGTCGTTTC TGATGGAGCG AATGCTTATA TTCTCTTAGG AAACGGTGAT
GGCACTTTCC CGGTCCACCA AGCCTACCCA ACCGGAGCTG CGGCTGATTA CCTTAATCCG
TTTCTGCCCA ATGCGCAATC GGTCGTAGTT GGCGATTTCA ATGGCGACGG CGCTCCGGAC
GTCGCGGTTT CAAATTCAGA CGGGGGTATC GCCGTTTTGC TCGGCAACGG GGACGGCACG
CTCCGCGCCC CCCAACTCTT CCCAGCTATA AAGAGTTCGC AGTCGCTTGC GATCGGAGAT
CTTAACCGAG ATGGCAAATT GGACATCGTA GCTTCCGACG GGAGTGGCAG CATCAGTATT
TTCCTTGGTA ATGGCGATGG TACTTTCCAG ACCAGCAAAG TTTATGCGGC CGTTGGATCA
CAAAGTGTAA CGGTCGGAGA TTTTAACGGC GACGGCATCC TTGACGTAGC GAGTGGGACG
GGACACACCG TAAGCCTGCT TCTCGGAAAT GGCGATGGCA GTTTGCAGCC GCCGGTGAAT
TACATTGTTG GACTCAGTGC AACCGGGCTG GCAGCTGGAG ACTTCAATGG GGACGGGGCT
TTGGACCTCG TCACCGAAGA CTTCTCGATT CTATTGAATC GCCAGGGAAC GCAGCTTAAC
GTGCAATCTT CCCGCAATCC ATCCAACGTA GGCCAACCTG TGACTTTCAC CGTGACCTCC
GCTGCGAGTT TGCCGGAGAC AGGCTTGCCT TCGGGCACCA TCACGCTGCG CGATGGGAGC
ACTGTGCTGG GAGAATCCGG GGTATCGGGA GTATTTGACG TGAAGGTCTC CGGATTGACT
GCCGGAACGC ACCAAATCAC GGCCACATAC TCCGGCGATA ACAACTTTCA ACCACACACG
ACCGCCATCC TGACAGAACA CGTCGGTGCT CCCGCCACGA TGATTAGCCC CGCTCTTGGA
TCGATTCTAA CCAGCACTAC CGTAACCTTC GCGTGGAAAG CAGCCGCGGG TGCTTCGCAG
TACAGCCTGT ACCTCGGCAC TAAACCCGGT CGGGACGACC TCGGGTACGT CAATGCGCAT
TCGAGCACGT CCGCGACTGT GAAAAACCTC CCATCCACGG GATCCTCCCT ATACGTCACG
CTCTTCTCGC TTGTCGGAGG GGTGTACTAT TCAAATTCTT ATACGTATAT CCTCCCTGGA
ACACCTGCCA AGGCCAAGAT GACCTCGCCC CTGCCCGGCA CGATGTTAGT CGGCAAGGAT
GCGACATTTA CGTGGAGCCA CGGAACGGGC GTCACCTACT ACAGTCTGTA TGTCGGGACA
AAGGGTTACG GCACTCACGA TCTGGATTTC ATCAACGCCA CGACTACCAG TGCCAGCGTG
TCGAACCTCC CTGCCGACGG GAGCACAATC TACGTTCAAG TAAATTCGTA TATCGACGGC
GCGTGGACTA GCCAGAGCTA CACCTATATA AGTGGAAGTG GAACTCCCGC GCCCGCAACC
ATGATCTCGC CCACCCCTGG AAGCAGTATC TCCGGCAACT CTGCGACTTT CACATGGACG
AGCGGCGTTG GAGTCAGCGA ATTCAGTTTG TACGTAGGTA CGGGAGGGGT GGGTTCCCAT
AACATTGCGT TCATCGAAAC CGGAACCACG AGTGCGACCG TCACTGGCCT TCCCGCTACC
GGCGCAACGA TCTATGTGCG CTTGAATTCG TTTGTCAACG GCGCGTGGCA GTGGGTGGAC
TATTCTTATC GGAACCCGTA A
 
Protein sequence
MRQVAAVSFF LVFTAVFSAY SQDLTQAKRT PAASGSTQPN VFLTPKQYPA GPSGVTSIAK 
GDFNNDSYMD VAVTNVSGTI TVLLGKGDGT FQAPVSYPAL SSPVSIAAAD LNGDGKLDLA
VANSGSGSIS VFLGNGDGTF QSHTDVAVGT SVQMLTVADF NGDGKPDLAV LVDGMVSVLI
GKGDATFNAI GEYAKPCATY LATGDFNGDG KTDIVAGRQC VLLGNGDGTF QPPVGSQKIG
NTVSTAVGDI NGDGKLDLIE GGIGDSDGTP RALVVVLLGN GDGTFQPPQG FFGYGSGVQG
LLLADVNGDS HPDIVLSSSE NVEVVNGKGD GTFEPGVLYP VGNRPVAGGL VLSDFTGSGR
LDLAVLTSCA NPSICGDGAV TLLRGKGDGT YVAPASYYIF EGDERFDAVG GWVAVGDFDG
DGKLDVLEVF DTRAFISLGN GDGTVQTGQR YYQVAYQSDG AVVGDFNGDG KLDAAILHSC
DVFYSDPGPG NPPPYCVSAG SVGVLLGNGN GTWQRWPESL YFGVGDTPTS IATGDFNQDG
KLDLVVSDGA NAYILLGNGD GTFPVHQAYP TGAAADYLNP FLPNAQSVVV GDFNGDGAPD
VAVSNSDGGI AVLLGNGDGT LRAPQLFPAI KSSQSLAIGD LNRDGKLDIV ASDGSGSISI
FLGNGDGTFQ TSKVYAAVGS QSVTVGDFNG DGILDVASGT GHTVSLLLGN GDGSLQPPVN
YIVGLSATGL AAGDFNGDGA LDLVTEDFSI LLNRQGTQLN VQSSRNPSNV GQPVTFTVTS
AASLPETGLP SGTITLRDGS TVLGESGVSG VFDVKVSGLT AGTHQITATY SGDNNFQPHT
TAILTEHVGA PATMISPALG SILTSTTVTF AWKAAAGASQ YSLYLGTKPG RDDLGYVNAH
SSTSATVKNL PSTGSSLYVT LFSLVGGVYY SNSYTYILPG TPAKAKMTSP LPGTMLVGKD
ATFTWSHGTG VTYYSLYVGT KGYGTHDLDF INATTTSASV SNLPADGSTI YVQVNSYIDG
AWTSQSYTYI SGSGTPAPAT MISPTPGSSI SGNSATFTWT SGVGVSEFSL YVGTGGVGSH
NIAFIETGTT SATVTGLPAT GATIYVRLNS FVNGAWQWVD YSYRNP