Gene Acid345_1710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1710 
Symbol 
ID4072055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2075860 
End bp2077428 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content63% 
IMG OID637983718 
Producthypothetical protein 
Protein accessionYP_590785 
Protein GI94968737 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.636106 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.107463 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAACT TCTCCCGCTC TTCCCTAGCC GCGCTCCTGA TCGTCGCCGG CCTCTCGCTC 
ACCGGATGCA ACAAACAAAG CGCTCCGACG GCCAACGCAG CAGCACCGCA GCAACAACAA
GCCGCTCAGC CCGACCAGTC GCAGGCTGCT CAGTCACAAA ATCCTGAGGA CAACGGCAAT
CTCCCGCCAG TCGACGCCAA CGGCAACCCC ACCGACCAAC CCACGAGCGA CCAGCAGAGC
TATCCTGCTC AGGATCAAAG CGCTAGCCAG CAACAACCGG CGCAGAACCA GGGCAATGCC
AGTCAGCAAC AGCAGTATCC CGACCAGGGT TCTCAGCCAG CCCAGGCGCA AGCTCCCGCC
TCGCAGAGCT ATCCCGACAA CAGCAACAAT GTTGGATACG GCGACAACAA TCAGGATTAC
GACCAGGACC TGTCCCAACA AGACAGCAGC TATGGCCAGC CTGCAATTCA GGCCCATCAA
GCGCCGCCCC CAATTCCCGA GTACCAGCAG CCCATGTGTC CCAGTCCCGG CTACGTCTGG
ACTCCCGGGT ACTGGAGCTA TGCTCCTGCC GGATACTACT GGGTGCCCGG CGCCTGGGCG
CGCCCGCCGC AAGTCGGCTT CCTCTGGACG CCCGGCTATT GGGGTTTCGG CGGCGGAGTC
TATCGCTTCC ACTATGGCTA TTGGGGACAC TACGTAGGCT GGTACGGTGG CATTAACTAC
GGCTTTGGAT ACGTCGGATC CGGCTACCAC GGTGGTTACT GGCACGGCAA CAACTTCTAC
TACAACCGTT CCGTCAACAA CGTGAATGTC ACCAATATCA CCAACGTCTA CAACAAGACG
GTCATCGTCA ATAACAACAA CCGCGTCAGC TACAACGGCC CCGGCGGCAT CACCCGCCGT
CCCACGCGCG CAGAAGCAGT TGCTGTCCGT CAGCAGCGTA TCCCGCCGAT GACCACGCAG
ATCGAGAACC AGCACAATGC CATGCGCGAT CGCCAACAGT TCGCGTCCGT CAACAAAGGA
CGCCCCGCGA TTGCCGCTGC TCCCAGACCG ATCGAGGCGG CAAAGCCGGT CGCGCCCGCG
ATCGCCGCAC GCCCGGTTCC GCGACCAGCT GCCGGCGCCA GGCCGAACCA ACCAAACAAC
GTCGCACGTC CAACTCCGCA GCCAAGTACG CGTCCCACGC CCGTTTCTCC TGCTCGTCCC
GAAGCGCGTC CTGTTCCGCG GCCCACCACC ACTCAACCGA GCGTGAAACC AACACCTCAG
CCGAGCACGA GGCCCACTCC GCAACCTTCC ACCCGTCCAA CGCCGCAACC GAACACGCAC
CCGGTTCCGC AACCAAAGCC CGCGACGCGA CCGACGCCGC AACCTTCCAC GAGGCCGACG
CCTCAGCCGA ACACGCGGCC TACTCCGCAA CCAAAGCCGC CGACACATCA GGCACAGCCC
AGCACACGCC CTGCGCCACA GCCCCACCCC GGGACGCAAC CGCCAGCCAA GCCTGCAACG
CGGCAGGCGC CACAACAACA GAGCAGGCCT TCCAAAGACT CGAAGCCAGA TCGGCCCGAA
CACCGATAG
 
Protein sequence
MLNFSRSSLA ALLIVAGLSL TGCNKQSAPT ANAAAPQQQQ AAQPDQSQAA QSQNPEDNGN 
LPPVDANGNP TDQPTSDQQS YPAQDQSASQ QQPAQNQGNA SQQQQYPDQG SQPAQAQAPA
SQSYPDNSNN VGYGDNNQDY DQDLSQQDSS YGQPAIQAHQ APPPIPEYQQ PMCPSPGYVW
TPGYWSYAPA GYYWVPGAWA RPPQVGFLWT PGYWGFGGGV YRFHYGYWGH YVGWYGGINY
GFGYVGSGYH GGYWHGNNFY YNRSVNNVNV TNITNVYNKT VIVNNNNRVS YNGPGGITRR
PTRAEAVAVR QQRIPPMTTQ IENQHNAMRD RQQFASVNKG RPAIAAAPRP IEAAKPVAPA
IAARPVPRPA AGARPNQPNN VARPTPQPST RPTPVSPARP EARPVPRPTT TQPSVKPTPQ
PSTRPTPQPS TRPTPQPNTH PVPQPKPATR PTPQPSTRPT PQPNTRPTPQ PKPPTHQAQP
STRPAPQPHP GTQPPAKPAT RQAPQQQSRP SKDSKPDRPE HR