Gene Acid345_0385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0385 
Symbol 
ID4069207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp440346 
End bp443270 
Gene Length2925 bp 
Protein Length974 aa 
Translation table11 
GC content59% 
IMG OID637982388 
Productintegrin like protein 
Protein accessionYP_589464 
Protein GI94967416 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.459486 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAAGT ACCTGGTCTC GAAAGCCTAC ACAGTTACCG GTTTACCCAA AGTCATCCAG 
CGCGGCATTG CGGCCGCGGT GTGGTTGGGA TTGATAGGGT CAGGTTGCTC TGCGTTTGCG
AGCACTGCGA CGACTACCAC ACTTGCGATC ACTTCGGGTG GCACGGCCGT TACGACGATC
AGTGCAGGTA GCCAGGTAAC CCTTACCGCC ACGGTGCTTG CAGGATCGAC GGCGGTGAAG
CAAGGACAGG TAAATTTCTG CGATGGCGCA GCGACGCACT GTTCGGATAT CCATGTTCTA
GGCACGGCGC AACTCAATAG TTCCGGCAAA GCGAAGATCA CATTGCGTCC GGCACCGGGT
TCGTTTAGCT ACAAGGCAGT TTTCGTAGGA ACTCCGAAGA CAACTACAGC GTATGCCGGC
AGTACTTCGT CAACGGCCAG CCTGACTGTG ACGGGAAAGC TGCCGAGTTT TACCACGATT
GCTGAGCTCG GTCCTTCGGT TGGCTACACG CTGACCGCGA CGACGTACGG ATTCACTAAA
GCAAAGACGG CTACCGCGCC GACGGGAACG ATTTCGTTTG TGGATACGAC GACGGGAAAC
TCCGTGCTGA CATCCGCAGC TTTGGCCACT CCGCAAGCAG CGGTGAACTG GGTGAATTCG
TACACGTCGA CGCTCGGCTA TCTACCGAAC GCGATGGTGG GAGCGGACTT CAACGGAGAT
GGCTATCCTG ATATCGCTGT GGAGTTGAGC AACACGGCGA ATCCGGTGGG GATTTATCTT
GGGAATGGCA CCGGCAATTT CAATGAAGTG ACGAAGAGCC CGATTATCGC GGCGGGTATT
CCGGTGCTGG CGCAGGATTT CAATGGAGAC GGGATTCCGG ATTTGGTGCT GTGCCACGGG
CACAATGACT CGCTGACCGT TCTTCTCGGC AACGGCGATG GGACTTTCAC GGAAGCGCCT
GCAAATCCTT TTGGCGATGG GCTCGGGATT CCACCCGTAG TAGTGGCCGA CTTCAATGGT
GACGGCATTC CGGACCTTGC TACGGGCGGC GCGGGGTCCC TCAGTGTATT TCTGGCAAAC
GGCGCCGGGG CGTTCACGCA GGTGCCGACG ACGTCGAAAA CGCTAATCCT GGGCAACTTC
GCGACGATGG TGGCGGGCGA CTTCAACGGC GACGGCATCA CTGATATCGC GGCACTCGAT
GCGACGTTCA GCGAGACGGT CCGCGTCTAT TTCGGGTCTG GCGATGGGAC GTTCACGACG
GGCCCGACGA ACATGGTCAG TCCGGGCGGA TCCGCTGGAG CACCAATGGT GATGGTTACA
GCCGACTTCA ATGGCGATGG GAAGGCGGAC GTCGCCGTGC CTCTATGGAA TGGAGGGGTG
GCGGTACTCC TTGGCAACGG AGACGGAACC TTTCAGGAAG CGAGCGGGAG TCCCATCGGC
TTGGGAGATT ACACGCTGCA GGTCGGGCTC GCTGATTTCA ACGGAGATGG TGTCCCGGAC
CTGATGCTCC AACAGGAATC GAACATAACC AACGCATACG CACTGCTGGG GAAAGGCGAC
GGCACGTTTA CCGTGAGTTC GAACCCAGCA CCGTACCTGC CATGTTGCGG GATTGCCCTG
TTGATGGACG TGAATGGCGA CGGGTTGACT GATGTTGTGA ATTCGTCGCA GTACGATGGC
ACCGCGAGCG TGCTGCTGAC ATCAGCGCAA CAGGCGACGA CGCAAGTGAC CGGGATTTCC
GTGGGCGGCA CGAGCCCACA TAACGTTGTC GCCAAGTACC CGGGAGACAC GAAATACCTG
GCGAGCATCT CCGCGCCAAC GGAACTGCAG CCTCCGGCGG CGGCTCCGGT ATTCACGCCG
GCATCAGGCT CGATCCGGCC CTACATGGAT TCGATCAAGC TGACGTCCAG CACGCCGGGG
GCGACGATCT ACTACCTGGC GGTGGGTGCT ATTGATACGG GCGGAAGCTA CGTTACGTAT
AACGGGCCTA TCTACGGCTA CAACATGGGA TCAGCGACGA TCCATGCGTA CGCACTGGCG
CCGCCCAACT ACGGCCAGAG CGCGACTGTG ACTGCGACGT TCAATGTGGT GGGTATCCCC
GCGGCTATGA CCAGCCCTGT TCCCGGCTCC CCGCTCACAG GGTCGAGCGC GACGTTCACC
TGGGACACTG GCATCGGGGG ATCGCAGTAC AGTCTGTATC TCGGCAGCAC ACCGGGCGCA
CACGATATTG CGTATATAAG CGCAGGAACG AACACGACCG CAACGGTAAC GGGGCTGCCC
ACGAACGGCG AACTGTTGTA CGTGACTCTG TACTCGTGGA TGGGGACTAA GTGGCAGTCC
AACGCGTACA CCTACGTCAC GTCCGGCAAG GGCACGGCCG GAACGATGAC CTCGCCGGCC
AACGGTTCCA GGATGACGGG CGGGACGCAG GCATTCAGTT GGACGAAAGG AACGGGAACC
GACGGGTATT CGTTGTATGT CGGCAAGACG GCAGGGAGCC ACGAGATCGC TTACGTGAAT
GCAGGGCGGG CGACAACTAC GTCGGTTAAC GGGCTGCCGA CGAACGGCGA AGAGTTCTAC
GTGACGCTGA ACTCGCTCAA CGGCAAGACG TGGTTGCAGA ATACGTACCA CTACTACGCG
TCGGGTAGCG GGACGGCAGC GGTTATGACG TCGCCTGCCA ATGGGACGAC CTTGGCGAGT
AGCACGGTGA CATTTAGCTG GACAGCGGGG ACGGGGATCA ACGAGTACTC GCTGTACATT
GGAACAAAGC CCGGGGCACA TGACCTCGCA TTTGTGAACG CCGGCAGCGC AACGACTAAG
ACGGTGAGCG GGCTCCCGAC GAACGGGAGC AAGGTGTACG TGACCTTGTA CTCGCGCAAT
GGGACGAAGT GGCTGTCGAA CAGCTATTCG TATACCGCCA AATAA
 
Protein sequence
MLKYLVSKAY TVTGLPKVIQ RGIAAAVWLG LIGSGCSAFA STATTTTLAI TSGGTAVTTI 
SAGSQVTLTA TVLAGSTAVK QGQVNFCDGA ATHCSDIHVL GTAQLNSSGK AKITLRPAPG
SFSYKAVFVG TPKTTTAYAG STSSTASLTV TGKLPSFTTI AELGPSVGYT LTATTYGFTK
AKTATAPTGT ISFVDTTTGN SVLTSAALAT PQAAVNWVNS YTSTLGYLPN AMVGADFNGD
GYPDIAVELS NTANPVGIYL GNGTGNFNEV TKSPIIAAGI PVLAQDFNGD GIPDLVLCHG
HNDSLTVLLG NGDGTFTEAP ANPFGDGLGI PPVVVADFNG DGIPDLATGG AGSLSVFLAN
GAGAFTQVPT TSKTLILGNF ATMVAGDFNG DGITDIAALD ATFSETVRVY FGSGDGTFTT
GPTNMVSPGG SAGAPMVMVT ADFNGDGKAD VAVPLWNGGV AVLLGNGDGT FQEASGSPIG
LGDYTLQVGL ADFNGDGVPD LMLQQESNIT NAYALLGKGD GTFTVSSNPA PYLPCCGIAL
LMDVNGDGLT DVVNSSQYDG TASVLLTSAQ QATTQVTGIS VGGTSPHNVV AKYPGDTKYL
ASISAPTELQ PPAAAPVFTP ASGSIRPYMD SIKLTSSTPG ATIYYLAVGA IDTGGSYVTY
NGPIYGYNMG SATIHAYALA PPNYGQSATV TATFNVVGIP AAMTSPVPGS PLTGSSATFT
WDTGIGGSQY SLYLGSTPGA HDIAYISAGT NTTATVTGLP TNGELLYVTL YSWMGTKWQS
NAYTYVTSGK GTAGTMTSPA NGSRMTGGTQ AFSWTKGTGT DGYSLYVGKT AGSHEIAYVN
AGRATTTSVN GLPTNGEEFY VTLNSLNGKT WLQNTYHYYA SGSGTAAVMT SPANGTTLAS
STVTFSWTAG TGINEYSLYI GTKPGAHDLA FVNAGSATTK TVSGLPTNGS KVYVTLYSRN
GTKWLSNSYS YTAK