Gene Acid345_2180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2180 
Symbol 
ID4071432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2602080 
End bp2603714 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content57% 
IMG OID637984196 
Producthypothetical protein 
Protein accessionYP_591255 
Protein GI94969207 
COG category 
COG ID 
TIGRFAM ID[TIGR03436] VWFA-related Acidobacterial domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.441496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACTC AATTCGTCGC AAAAGTTGTC CTTCTCTCCA CCCTGTTTGT CCCACTCACC 
TTCGCCCAAA CAGCGCCTGC CTCAGACAGC GGCATCCCTA AGTTTACATC GCGGTCAGAA
TTGGTCATGG TGCCGGTGGT GGTTACCGAC AAGTCCGGCC AGCACGTCAA AGGGCTAAAG
AAGGAGGATT TCGCCGTCCT GCAAAACGGC AAATCAATGC CGATCGCCGT CTTCGATGAG
ATCAACACCC AGGACACGGC GCTTCCCCCG CACCAGACTC CCGACGGCGT CTTTTCCAAC
GCCGTCCCGA CTGACAACGT CTCGCGCCGC GTGATGGTCA TCGTTATGGA CCTCCTTAAC
ACACCTTTCG CCGACTCGGC TGACGCGCGA CGGGCCCTAT TTAATAACGC CAAGGTGCTG
ATCGACGCCC ACATCCCCGT ATCGTTAATG GTCATTAACA GTACAGGACT CCACGAGATT
TTCGGACTGC ACTCAGACCC GCGCATTCTA GAGACTGCCC TCGATCGGGT CGGCACCCCG
AACCCGACAG ACCTTCACGA ATCGGCAACA TTCGGATGGG ACCTCGGAGA CAAGTACCTC
TACAACCAGT ATCAGGAAAC CGTTGCTCAA CTTCAGAATT CGATCGGCGG CCCCGGCACC
AAACTCGGCA CCTCGAACCC GAACCCCGAG GCCGCGTATC AAAGCTACCG CACGCATCAT
GACGTCGAGA TCACCCTTTT TTCTCTCGAA CAACTGGCCC ACGCCTACTC CGGCATCCCT
GGGCGCAAGA TCATGATCTG GATCACCGGC GGCTTGCCAA TGAATATCCT GGATCCAACT
TCAATCTCCG CGTACGGCGA CATCATGCTC GACACCTATC GCCGCACCTT TCTCGTTCTC
AACGCGGCGA ACTTCTCTGT CTACCCGGTT GATGCCCACG GTCTTGGGCT CGAGTCCATG
AGCAAGCACC TGAACCTCAC ATCCACGATG AATGCGTTCG CCGACGCCAC AGGCGGCACC
GCCTTCTACA ATCGCAACGA CATAGGGGTG GGCATTCGCA ACGCCGTGCA GGACGCGGTT
CAGTACTACG AAATCGGCTA TTATTTGCCG CACGAAAGCC ACGGCAAACC GCAATGGGAG
AAGATCAAGG TGAAGGTCGA CCGCAAGGAG ACCGCGGTTC GCACCCGCGA CGGATTCTTC
TCCGGCTATT CCGAAAAGCC GGAACCGAAG AGCCTCAAGA TGGAAATGGA ACTCGCATTC
GCCTCCCCCG TCGCCTACAC CGGCTTTCCG ATCGCAATCA AGGTCTCCGC GCTGGCTTCC
GGCGCTAACC TCACGTTCGA TCTCACGGTA CCACCGCGCT CCTTCCGCAT CGATCGCGAA
AACAACAATT TTGCCAACAT CGAATTTGCC GCCGTCGCCC TCGGTCCACA CCATCAGTCC
AGCGGCATCT TCGCACGACG CATCACCGGA AACCTCACTC TCGAGAACGC GGACCACATC
GAGAGCCAGG GTTTTGCGAT GCACGACACG ATCACCCTTC CCGCGAATAC GAATCGAGTG
AAATTCGTAA TTCGCGATAA CCTCACGGGC AAAATCGGCA GCGTGGTCGC ACGCTTGAAT
ACCAACTCAA AATAA
 
Protein sequence
MRTQFVAKVV LLSTLFVPLT FAQTAPASDS GIPKFTSRSE LVMVPVVVTD KSGQHVKGLK 
KEDFAVLQNG KSMPIAVFDE INTQDTALPP HQTPDGVFSN AVPTDNVSRR VMVIVMDLLN
TPFADSADAR RALFNNAKVL IDAHIPVSLM VINSTGLHEI FGLHSDPRIL ETALDRVGTP
NPTDLHESAT FGWDLGDKYL YNQYQETVAQ LQNSIGGPGT KLGTSNPNPE AAYQSYRTHH
DVEITLFSLE QLAHAYSGIP GRKIMIWITG GLPMNILDPT SISAYGDIML DTYRRTFLVL
NAANFSVYPV DAHGLGLESM SKHLNLTSTM NAFADATGGT AFYNRNDIGV GIRNAVQDAV
QYYEIGYYLP HESHGKPQWE KIKVKVDRKE TAVRTRDGFF SGYSEKPEPK SLKMEMELAF
ASPVAYTGFP IAIKVSALAS GANLTFDLTV PPRSFRIDRE NNNFANIEFA AVALGPHHQS
SGIFARRITG NLTLENADHI ESQGFAMHDT ITLPANTNRV KFVIRDNLTG KIGSVVARLN
TNSK