Gene Acid345_0460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0460 
Symbol 
ID4069455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp554601 
End bp557975 
Gene Length3375 bp 
Protein Length1124 aa 
Translation table11 
GC content57% 
IMG OID637982464 
Productintegrin-like protein 
Protein accessionYP_589539 
Protein GI94967491 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.849275 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAATG GCGTGCGTAC TCATTTGCCC GCGCAACCGA GCTTCAAGGG AAAGATATTT 
CCCTTCAATC GCCCACCCTC GTCAGATTCA TCGGAGAGAA TTCGTAACTC CGCATTGGCC
CGAGCGAGTA GTTCAACCGC GTCCATGCCA GGACTACTTG GCGCACCTTA TGTCCCCGCA
ACGATCGACA GCGACACTTC TGGCGTGTAC TCATCGGCGT CCACTGATGT CGATGGCAAG
AGTGGAATCG ATCTGGTGAC CGTGGACTAT GACGGAACCT TGAACGTGCT TCTGAATGAC
GGTCATGGGA AGCTGGCTCT GTCCTCCAGC AACACCTCCG CCGTGGCGCT CAATCCAGGC
ATCGTTTACA TCGAAGTCGC TGACCTCAAC CACGACGGAC ATCCCGACGT GGTTGCGATG
GATGCCTCAA ACAGCGCCTT CCTCGTGTTT CTGAACAATG GCGACGGAAC GTTTGGCGAT
GCGTCGAGCG TGTCGGTGGC TCCTGCAAGT GGCGCCAACT ATTTGAACGG GGGCTCCTTC
GCGGTTGGCG ATGTCAATGG TGACGGCATC CCAGACGTCG TGGCCATCTC GAACCTTCAA
AACGGAAGCT ATCCCACCTA CACCACGGTT TTCTCGCAGC AGACTTTCCT TGGAAATGGC
GACGGGACTT TCGCTGTTCC TGGGAGCAGT ACGGACGTGA CCTTGCCGGG TTTCCAGTTC
GTGCAATACG GGCAGGGAGT CGTCATCGCT GATCTGAACG GCGATAAGAA GGCCGATATT
GTGACCGAGT ATTACGACGC GATCAATTAC TGGGTGACCG TGGCGGGTAG CCTAGGTAAC
GGGGACGGCT CATTCGCCGC GATCGGATCC GGTTCGGCAG TGCGCGCTGT TCTCGGCATT
CATTCGGCTT TGCGCGTTCT TGATCTCAAT GGCGACAAGA ATCCAGACGC GGTCTTCATG
GTTGGTGATG GGAACGCGTA TGTTGCCCTC GGAAACGGCG ACGGTACCTT CCGCTCCTCA
AACACCGTAC TGCATGGTCT TCCGGCGGCC CAGGCATTTC AACTGGGCGA CTTCAACGCC
GACGGTTATC TCGATTTACT TGTCTTCAAC ATCGGCAGCC AAGCCGTCTA TGCGGGGCGC
GGCGATGGGA CGTTTAATCC AGAGCCAAAA GCTCAGTACG GTGGAAACCA GGCTGGCACT
CAGGAGCCCT CACCCGCGGA CTTTGACGGC GATGGCTTCG TCGACTTCGT GATTGTGGAC
CCCGCGTATG GAACCGCGTC GCTCTACTAC GGAAATGGCG ACGGAACGTT TGTCGCCCAG
CGTGCGATTG CTCCGTCGAA CTCATCTTCG ACGATCCCAA ACGCGCAGGA GGCTCCGAGC
AACTTTGTCG CAGTGGCGAC CGGCGACGTC AACAACGATG GCCTGGCCGA TGTGCTCGCT
TACGACTACA CGAGTCCGTT TTACACCGAC TTTCGCATGT ACCCGGATGT GGTGCTTGGC
ATCGGCGATG GCAAAGGTGG GTTCAGCTTC ACAAAGCTGG TGTCGGGTTA CCAGTTGTGG
ATTGCAAAGG GGTTCGCGGT GGCCCCCATT GTTTCCGACT TCGACCACGA CGGAAAGTTC
GATTTGGTGC TTGAGACTGA CACTGGCCTG TCGATTGCAC TGGGGAACGC GGATGGCACG
ATGGGAGCGT TCAAGGCGAT CTCGCTGCCC GTCGCGAGTT CCGGCTGTTC TTTCGGAAAC
TTCGACGTCG GCAATATCGA TAACGACTCG AACGGAAACC TCGACATGTT GATCGCATAC
AACGGTGAAG GTCCGAATTG TATCGGCAAC GACACCCCCT CCGGCTACTT CGTCCTGCTT
GGCGATGGCA AAGGCGGTTT CGATGCCGCG TTTACCCCAT ATGGCGGATA CGTGTCTGAG
ACCAAATTGA TCGACTTCGA CGGAGATGGC ATTCTCGATA TGGCGGTCGA CGACGGTTAC
CAGACCGGCA TCCTCATCTG CAATAGTTGC CCCACGCCCA TCACGGTTCT GAAAGGACAC
GGCGACGGCA CCTTTGATAC CGACGGCGCG CACCTGGTGC TGGATGGTTG GCGCATTTCA
TGGCTAGTTG TGGGTGACTA CGATCACGAT GGCAAGCAGG ACCTGACGCT CTTGTCGCGA
GGCCAGCGCG ATGATGCGTC GGGTGGCCTC GTGGACGATA CCGAAGGCAT ACTCCTGCTA
AAAGGCAACG GCGATTTTAC GTTTAAAGAT CCGGTGCTAA TTGCGAAAGA TACTTTTGCG
ACCCAGGCGC GATACGTCGA TTTAAATGGC GATGGGTTTC CTGACCTCAT CTTTGCACTC
GACATGTCAT TCGAAGCGGT GCCAACTTAC AGCGGCTTGG TATTGATGCA GAATCTTGGC
GACGGCTCTT TCTCCGAGCC GATGAACTAC CTTGAACCAT TCGGTACCGA GCAAATCTCC
ATCGCTGACT TTAACGGCGA TGGCGCGCCC GATGTGTTGG TAGCGCCCTA TTTCTACGAC
GGATTCACTC CCTCTGTGCT GTTCTTGAAC CAGGGTGGAG ACTCCCTGAG CTTGACAGCA
ACACCGAACT CCGTGACGCA GGGCGAAGAC GTGGTGCTGC GCGCTACGTT GAACGCCGTT
TTAAGTGGAG CCTCTGGCTC TGTAACGTTC ACGTCGAACG GCGCGGTGAT CGGAACCGCT
GAGTTGAGCA CAGGTTCGGT CGCCGAACTT ACTACATCGG AGTTACCCGC AGGAACCGAC
ACCGTGACGG CCACCTACGC CGGAGATGCG AACCATAACG CAGCGACCTC GGCGGTCGCG
AGTGTCTCGG TGCAAAGCTT GGCGCCTTCG TTCAGCATCA ACGCCACTCC ACCCTCACTG
AATTTGAAGG CTGGGCAATC TGGATCGGTG ACGTTGAACC TTGCGGCGAA CGCCATGTTC
AGTGGATCAG TCAGCTTTAC CTGTTCCGGG CTGCCTGCCA GTGCGAGCTG CTCATTCTCG
CCGGCTTCGG TGGCACTCGA TAGCGGAAAA ACTGGCACCG TTACTCTACA AATAAAGACA
GCTGGCACCA GTCAGGCGCG CGCAATTACC AACCGCTCGC TGTGGACTTC TGCTGCGGGA
CTCGCGCTGA TGCAGTGCCT GTTGCTGGTG CTGCCGATCC GGCGTCGCAA GCTCGCACGA
TTGTTCGCGA GATACTCCGC GGGCCTGCTC TGCCTTCTGA TGTTGGTTGG CATTTCTGCG
TGCGGCGGGA GCAGCCATGA CAGTTCAGGT GGAACACCAA CCGGCAATTC GACAATCACT
GTTACAGGGA GCGCCAGTTC TGGCTCCACG ACAGTGACGC AGACTGCCTT GATTTCTGTC
TCAGTAACCA ACTAG
 
Protein sequence
MRNGVRTHLP AQPSFKGKIF PFNRPPSSDS SERIRNSALA RASSSTASMP GLLGAPYVPA 
TIDSDTSGVY SSASTDVDGK SGIDLVTVDY DGTLNVLLND GHGKLALSSS NTSAVALNPG
IVYIEVADLN HDGHPDVVAM DASNSAFLVF LNNGDGTFGD ASSVSVAPAS GANYLNGGSF
AVGDVNGDGI PDVVAISNLQ NGSYPTYTTV FSQQTFLGNG DGTFAVPGSS TDVTLPGFQF
VQYGQGVVIA DLNGDKKADI VTEYYDAINY WVTVAGSLGN GDGSFAAIGS GSAVRAVLGI
HSALRVLDLN GDKNPDAVFM VGDGNAYVAL GNGDGTFRSS NTVLHGLPAA QAFQLGDFNA
DGYLDLLVFN IGSQAVYAGR GDGTFNPEPK AQYGGNQAGT QEPSPADFDG DGFVDFVIVD
PAYGTASLYY GNGDGTFVAQ RAIAPSNSSS TIPNAQEAPS NFVAVATGDV NNDGLADVLA
YDYTSPFYTD FRMYPDVVLG IGDGKGGFSF TKLVSGYQLW IAKGFAVAPI VSDFDHDGKF
DLVLETDTGL SIALGNADGT MGAFKAISLP VASSGCSFGN FDVGNIDNDS NGNLDMLIAY
NGEGPNCIGN DTPSGYFVLL GDGKGGFDAA FTPYGGYVSE TKLIDFDGDG ILDMAVDDGY
QTGILICNSC PTPITVLKGH GDGTFDTDGA HLVLDGWRIS WLVVGDYDHD GKQDLTLLSR
GQRDDASGGL VDDTEGILLL KGNGDFTFKD PVLIAKDTFA TQARYVDLNG DGFPDLIFAL
DMSFEAVPTY SGLVLMQNLG DGSFSEPMNY LEPFGTEQIS IADFNGDGAP DVLVAPYFYD
GFTPSVLFLN QGGDSLSLTA TPNSVTQGED VVLRATLNAV LSGASGSVTF TSNGAVIGTA
ELSTGSVAEL TTSELPAGTD TVTATYAGDA NHNAATSAVA SVSVQSLAPS FSINATPPSL
NLKAGQSGSV TLNLAANAMF SGSVSFTCSG LPASASCSFS PASVALDSGK TGTVTLQIKT
AGTSQARAIT NRSLWTSAAG LALMQCLLLV LPIRRRKLAR LFARYSAGLL CLLMLVGISA
CGGSSHDSSG GTPTGNSTIT VTGSASSGST TVTQTALISV SVTN