Gene Acid345_4463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4463 
Symbol 
ID4070946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5295212 
End bp5297500 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content64% 
IMG OID637986502 
Productputative proline-rich transmembrane protein 
Protein accessionYP_593537 
Protein GI94971489 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.364594 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCTA TGGCCCAACC GCAGCAGGGG GACCCGCCGG GTGTGGCGGG TCGGTTGTCG 
TTTATGCAAG GGCCTGTTTC GATGCAACCT GGAGGGGTGG ACGATTGGGT CGATGCTACC
CTCAACCGTC CGTTAACGAC TTCTGACCGT CTTTGGGCAG ACCAGGGCGG ACGTGCTGAG
GTCAGCATGG GCAACGTGAA AGCCCGTATT GACCAGCAAA CCAGCATGAC GATCGTCAAT
CTCGACGATC AGATTACGCA ACTTCAACTC GACCAGGGCA CGCTGTTCGT GCACGTTCGC
CGGTTGTTCC AGGGCGAGTC GGTCGAAATC GATACGCCGA ATGTCGCGTT CGTGATGGAC
CGTGAAGGCG ACTACCGTTT CGATGTCGAT CCCAACGCCG ACACGACTTA CGTTTCCGTG
CGTCGCGGCG ATGGTGAAGG CACCGGCGAA GCTGCGGGCG TCCACGTGCG TTCGGGCGAG
CAGGCCACGT TTGCTGGGGG GCAATCGGGT CTGGACCAGA TGGCGCAACT CGGCGATCCG
GATGATTTCG ATCAGTGGAA CGCGGAGCGC GATCGTCACC AGGACAACAC GCAGTCGGCG
CGTTATGTAT CGCCGGGAAT GGTCGGTACG GACGACCTCG ACGATAACGG CGCGTGGAGC
GAAGATCCGC AGTATGGTCC GATCTGGCAT CCGCGTGTAG CGGCGGGGTG GGCCCCGTAT
CACCAGGGGC ACTGGGCTTG GATCGACCCG TGGGGATACA CCTGGGTTGA TGATGCGCCT
TGGGGCTTTG CGCCGTTCCA CTATGGCCGC TGGGTCAGCG TGGGCGGAGG ATGGGCTTGG
GCACCTGGGC GTCCGCAGCC GGTAGCTGTC GGCGTCGCTT ATGTCCGTCC GGTTTACGCT
CCGGCGCTGG TCGCTTTCAT CGGCGGCCAC AACTGGGGTG TGAGCATCGG CGTAGGCGGC
GGCGGCGGAC CGGTGGGATG GGTCCCGTTG AGCTACGGTG AGCCGTACTA TCCGAGCTAT
CACGTCAGCC CGAACTACGT TCGCAACGTG AACGTGACCA ACACGCACAT CACCAACATC
ACCAACGTCA CGAACAACTA CACCGTCATC AACAACAACA ACACGACGGT CATCAATAAC
AACAACAAAG AGAAGGTCGT GTATCGCAAC GCGTCAGTCG CGGGCGCAGT AACTGCGGTC
CCGGCGAACG CGATGGCGAG TGGGCGTTCG GTGAATCAGG TTGCGGTTAA GGTAGATCCG
AAACAGATGG AACAGGCGAG GTTCTCGGCT GGGCCATCCG TGGCGCCGAC CAAGGCAGCA
GTGTTGGGCG GCAAGGCGCC GGCGACGCAG CATACTCCGC CGGCTGCCGC GATGAACCGT
CCGGTGATGA CCAAGGCAGC ACCGCCTCCA GCGCCGGTGA AGTTTGACGC CAAGCAGCAG
TTGTTGCAGA AGTCGGATGG CCGTCCGCTA ACCCAGTCGC AGGTTGCGAC GTTGCCGAAG
GTGCAGCGTC CGGCGGCGAT GGAAGCGGGC AAGCCGCGTC CGGCAGCAGC GGTGCAGCAG
ATCAAGGCGG CAGCGGCTGC AGCGCCGAAG AATGCTCCGG CATTCCATCC TCCGGCAGCG
AAACCTGCTC CGGGCGCTGT AAACAACAAT GCGGGCAAAC CGGGCACTCC GGCCACGCCA
GCCAACGTCA ACAAGCCGGG AACCCCGGCG GGTCCGGCGA ACGCTGGAAA GCCTAACGTT
GCGACGCCGC CGAATGGGCG TCCGACGCCG GAAGCTACGC CGCGTCCGGG TGCGAACAAC
ACTCCGGCGA ACCCGGCCAA TCGTCCAACC ACGATGGGTC CGGGGGGTGA TCGCAACGTA
GCGCATCCGC CGACTTCGAA TCCGTCAACG CCTGCGAAGC CGAATGAGAC GGCAACTCCG
GCAGGACGTC CAGTGCCGCA TCCTCCGACC GCGCAGCCTG CGCGCCCGGA GACTACGACC
CCTGCGCGTC CGAACAATCC GTCGGCCGAG AGCAACACTG CGCGTCCGCC GGAGCACGCT
GTGACACCGA ACCGTCCGAC GCCGCAGCCG CAGACGGCAG TGAAGCCGCC GGCGCATGCG
GAGCCGCACA CCACGCAAAC GCCTGCGACT CGTACGCCTC CGCCGCCGCA GCACGAAACC
AACGTGCCGC GTCCGCCGGA TCACAACGCA GCGCCAGCGC AACACACCCC TCCGCCGAAG
CAGCCGGCGA AGGACAACAA TAAGGACAAC AAAGACAATA AGGACGATAA GCAACCGAAG
TTGAAGTAA
 
Protein sequence
MPAMAQPQQG DPPGVAGRLS FMQGPVSMQP GGVDDWVDAT LNRPLTTSDR LWADQGGRAE 
VSMGNVKARI DQQTSMTIVN LDDQITQLQL DQGTLFVHVR RLFQGESVEI DTPNVAFVMD
REGDYRFDVD PNADTTYVSV RRGDGEGTGE AAGVHVRSGE QATFAGGQSG LDQMAQLGDP
DDFDQWNAER DRHQDNTQSA RYVSPGMVGT DDLDDNGAWS EDPQYGPIWH PRVAAGWAPY
HQGHWAWIDP WGYTWVDDAP WGFAPFHYGR WVSVGGGWAW APGRPQPVAV GVAYVRPVYA
PALVAFIGGH NWGVSIGVGG GGGPVGWVPL SYGEPYYPSY HVSPNYVRNV NVTNTHITNI
TNVTNNYTVI NNNNTTVINN NNKEKVVYRN ASVAGAVTAV PANAMASGRS VNQVAVKVDP
KQMEQARFSA GPSVAPTKAA VLGGKAPATQ HTPPAAAMNR PVMTKAAPPP APVKFDAKQQ
LLQKSDGRPL TQSQVATLPK VQRPAAMEAG KPRPAAAVQQ IKAAAAAAPK NAPAFHPPAA
KPAPGAVNNN AGKPGTPATP ANVNKPGTPA GPANAGKPNV ATPPNGRPTP EATPRPGANN
TPANPANRPT TMGPGGDRNV AHPPTSNPST PAKPNETATP AGRPVPHPPT AQPARPETTT
PARPNNPSAE SNTARPPEHA VTPNRPTPQP QTAVKPPAHA EPHTTQTPAT RTPPPPQHET
NVPRPPDHNA APAQHTPPPK QPAKDNNKDN KDNKDDKQPK LK