Gene Acid345_4380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4380 
Symbol 
ID4071798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5191514 
End bp5194216 
Gene Length2703 bp 
Protein Length900 aa 
Translation table11 
GC content62% 
IMG OID637986413 
ProductTPR repeat-containing protein 
Protein accessionYP_593454 
Protein GI94971406 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAGGTT TTGGTTTCAA CAAGAACAAG GTATTGGCGA ACGCCGAGCG TTACGTGCAG 
CAAGGCAAGC TGCAGAACGC GATCACTGAG TACGAGAAGA TCACCAAGGC GGACCCCAAG
GACCTGACGG TGCTCAACAC CATCGGCGAC CTCTACGCTC GCGTGGGGCA CGTGGAACAG
GCCACCGACT ACTTCAGGCG AGTGGGCGAT CGCTATTCCT CCGACGGCTT CACCGTAAAA
GCCATCGCCA TGTACAAGAA GCTGACGAAG CTGAATCCGC AGGCCTATGA CTGCGTGCAG
CGCCTCGCCG AGCTTTATAC CCAGCAGGGC CTTTACTCCG ACGCGCGCCA GCAGTACCTC
GTTGTTGCCG ACCAATTCAT GCGCAGCAAC CAGCTCCAGG ATGCCGCCCG CATCTTCCAG
CGTGTTCTCG AACTCGATCC CGAGAACACC ACGATGCAGA ACAAGCTGGC TGACCTCTAC
GAGAAGGTCG GAAATAAAGA TGAAGCGCGC AATATGTTCT TCCGCAGCGC CGACTCGCTC
TATGCCCGCG GCTCGCTGCC ACAGGCCGAT GAAGCGCTCG GACGCGTTCT CGCCCTCGAT
CCCCACAACT CGCGCGCGTT GATGCTGCGC GGCCAGATCT CGATGGAGGG CGGCGACGCC
GCCGGGGCCA TCACCTGGCT CGAGCAAGTC GCCGACCTCG AAACCAATCC CGACGGACTG
AAAGAACTCG TCCGCGCTTA TTTGAAAGCG AATCGTCTCG CCGACGCCGA GCGCCAGGCG
CGCACCCTGC TCATCCACTA CAAGGACATC TGGGGCATCG CGGTAACCGT CGATGCGCTG
ATGCAAAACG GCAAGTTCCT GCCGGCGCTC AAGCTGCTCG ACGAATTCGC CGATCGTCTC
GTCGCTGCCA ACGACAGTGG TACCGCCGAG ATGCTGCACA ACTGCATTGG CCGCGTGAAG
GACGATGCCA CCGCGCTCGA CCTGCTGCGC AAGCTCTTCA TCAAGCTCGG CCAGAAGGCG
TATCTCAACG AGATCAGCGA ACTGCTGGCG CATGCGTTGG TGCAGAAGGG CGACCTCAAG
CGCGCCCGCG ACATCTATCG CGAACTGGCA GAGCTGGAGC CCGAGAACCC GGTCCATGCG
CAGAACTATC GCCAGCTCTC GGCGAAGCTG GGCGACGATC CGCTCTCCAA GCCACTGGCG
AAGGAACAAG CCAATGCGCC GATGATGGTG GAAGAGCTCG ACAGCACCTC CGCCGAGATC
GAACAGGAAT GGGCGCCCGA CGTCGAGGCC GCCGTGCGCG CAGCGCTCAC CGACAGCGAG
CTCTTCGATT CCTACAACCT GCCGGCGAAA TCCATTGCGC CGCTCGAAGC GGTGCTGCCG
AAAGCGCCGG GCGATGTGCG CCTCAACCAG CGCCTCGCGG GCTTATATCG TAAGGCGAAC
CGTCCTAAGG ACGCCTCGCG CTGCTGCGTG ATCCTTCAGA AGATGTTCGA GCACTACGGA
CATCCCGACC GCGCCAAGGA ATTTGCCTCG CTCGCGTTGA AGTACGCGGG ACAGGCCGGC
ATCCCGATTC CGTTCGTAGA CATCACGCCG TGGCTGCCGG AGCCCAAAGC CGCTGCGCCT
GCCCCGGCCG CGCCTGCCAC CGTCGAACAC ACCTTCGAGA TCGAAGTGGA AACGCCTGCT
GCAGACGCCA ACCTGCCGGA GTTCGCAGTT GCTGCGTCCA CCAATCAGGA GATCGATATC
TCCGATGACT GGGAGACGCA CACCGACGAG CCCGTGAAGA CCGTCGAAGG CGCCCCGCAG
GAACTCGTCG AAGAGATCCA GTTCTATCTT GGCCAGGGCA TGGTGGATGA GGCCAAGGTC
GCAATTAAGA AGCTTGAAGC GATTGCTCCC GCGTTCTCCA AGCTGCGCGA GTGGAAGGCG
AAACTGGCCT CTCCTGCCGC AGTACAGGCA GACGAAGATC ACGTGCTTTC GCTCGACGAC
ACGCAGACGG TTGCCGCGAA GCCTCCCGCC GAACAAGGCA TGGGCGGTTT CGTCAGCGAC
CTTGAAGACG CGCTCGGCGA CGACTTCAAC GTGGCCGGTG CACCGGCGAA ACCTGCTGCA
GCGGCTCCGT CGAAGCCTGC GGCTGCAACA TCGCGGCCCA CTCCACCTCC GCCGGCTCCG
CCGCCTGCGG TGCGTCCGCA GGTTGCCGCG GCTGCCTCGG CGCCTGCGCC CGTCGCCGAA
GAACTTCCGG CCGTGGACCT GTTTGCCGAT CCCGGCGCCA GCGACATGCT GGGCGACCTC
TTCGCGGAAT TCAAAGAAGA TGTGGAAGAG GGCGCCGCCG ATTACGGCGA TCCCGATACC
CACTACAACC TCGGCATGGC CTTCAAGGAA ATGGGCCTGA TGGACGAAGC CATCGGCGAA
CTGCAAAAAG TTTGCCAGGC GGTTGACCAT GGCGTGCCGT TCTCCCAGGC GTTCCAGGCA
TACACCTGGC TGGCGCATTG CCTCATCGAA AAGGGCGTTC CTGATGCCGC GTATCGCTGG
TACGAGAAGG CGCTGACCAT CGCGCCGGAC CAGGAAACTC GCACCGCGAT CCACTACGAA
CTCGGTTCCG CCTACGAAGC CGCAGGCAAT AGGCCGCAGG CCCTCCAGCA TTTCATGGAG
GTCTACGGCG TAAATATCGA CTACCGGGAC GTGGCAGAGC GCATCAAAGG TGTCAGGTCG
TAG
 
Protein sequence
MLGFGFNKNK VLANAERYVQ QGKLQNAITE YEKITKADPK DLTVLNTIGD LYARVGHVEQ 
ATDYFRRVGD RYSSDGFTVK AIAMYKKLTK LNPQAYDCVQ RLAELYTQQG LYSDARQQYL
VVADQFMRSN QLQDAARIFQ RVLELDPENT TMQNKLADLY EKVGNKDEAR NMFFRSADSL
YARGSLPQAD EALGRVLALD PHNSRALMLR GQISMEGGDA AGAITWLEQV ADLETNPDGL
KELVRAYLKA NRLADAERQA RTLLIHYKDI WGIAVTVDAL MQNGKFLPAL KLLDEFADRL
VAANDSGTAE MLHNCIGRVK DDATALDLLR KLFIKLGQKA YLNEISELLA HALVQKGDLK
RARDIYRELA ELEPENPVHA QNYRQLSAKL GDDPLSKPLA KEQANAPMMV EELDSTSAEI
EQEWAPDVEA AVRAALTDSE LFDSYNLPAK SIAPLEAVLP KAPGDVRLNQ RLAGLYRKAN
RPKDASRCCV ILQKMFEHYG HPDRAKEFAS LALKYAGQAG IPIPFVDITP WLPEPKAAAP
APAAPATVEH TFEIEVETPA ADANLPEFAV AASTNQEIDI SDDWETHTDE PVKTVEGAPQ
ELVEEIQFYL GQGMVDEAKV AIKKLEAIAP AFSKLREWKA KLASPAAVQA DEDHVLSLDD
TQTVAAKPPA EQGMGGFVSD LEDALGDDFN VAGAPAKPAA AAPSKPAAAT SRPTPPPPAP
PPAVRPQVAA AASAPAPVAE ELPAVDLFAD PGASDMLGDL FAEFKEDVEE GAADYGDPDT
HYNLGMAFKE MGLMDEAIGE LQKVCQAVDH GVPFSQAFQA YTWLAHCLIE KGVPDAAYRW
YEKALTIAPD QETRTAIHYE LGSAYEAAGN RPQALQHFME VYGVNIDYRD VAERIKGVRS