Gene Acid345_1726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1726 
Symbol 
ID4072071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2092821 
End bp2094047 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content57% 
IMG OID637983734 
Producthypothetical protein 
Protein accessionYP_590801 
Protein GI94968753 
COG category[S] Function unknown 
COG ID[COG3503] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCC GTCCATTTTT ATCTCTCAAT CAAGCGACTT CGGCCCGATC GAAGCGCCTC 
CAATCGGTGG ACATTCTTCG CGGCGCCATC ATGATGCTGA TGGCCATCGA CCACATTCGC
GATTTCGTCC ATCGCGGCGC GATGCAGTTC TCCCCCACCG ACCTTACCCG CACCACCGCG
CCGATCTTCC TCACCCGCTG GATCACCCAC TTCTGCGCCC CGGTCTTTTT TCTTACCGCC
GGCATCGGCG CATTTCTCTG GATGTCGCGC GGCAATCACA CCAAGCGCGA ACTCTCATGG
TTCCTCCTGA CCCGCGGCCT CTGGCTCATT CTTATCGAAA ATACGATCCT GCGCGTCGTG
ATGTTCTCGC AGGTGAGCTA CCGTGGATCC GTCATCATTC TGCTTATCCT CTGGGGACTC
GGCGCATCGA TGATCGCTCT CGCTGCACTC GCGCATCTCC CAATCCGCGT TCTCGCGCCG
CTGAGTCTTC TCGTGATCGT GATCCACAAC GCCTTCGACC CGCTGACCGC CGATAAGTTC
GGCCGCTTTG CATGGCTCTG GGACATCCTC CATCAGCAAG GCCTCTTCAC GGTCGCAGGA
TTCAACTTCG TCACCGCCTA TCCGATAGTT CCGTGGATCT TCGTCATGTC CGCCGGCTTC
TGTCTCGGCA CCGTGTTCCT TTGGGATCTC GCGCGTCGTC AAAGTTTCCT GCTGCGCCTT
GGCCTGACCA TGACCGCTGC TTTCTTCGTC GTGCGTGGCA TCAACATCTA CGGCGATCCT
TCTCGCTGGA TCCATCAGTC CACCGCAACC CTCACCGTGC TTTCCTTCCT CAACGTCACT
AAATACCCGC CGTCGCTCGA ATTCTTATTG ATGACGCTCG GCCCCGCGTT CATCGTCCTT
TCGCGTCTCG AAAACATGGG CCTTTCCGAA GCCAACCCTT TCGTGGTCTT CGGACGCGTT
CCGTTCTTCT ATTACGCTAC GCATCTCTTC GTCATTCACC TCGGCAGCAT CTTGATGAAT
TTCGTCTACT ATCGCCACAC TTCATTCCTC CTGCTTCCCG CACCTTCTAT GGGCGGTGAC
CCCAAACTCT TTCCTCCCGA CTTCGGATTT CCTCTTTGGG TTGTCTACGC CTTCTGGCTC
GCGACGCTTG CCGCCCTGTA TCCAGCCTGC CTCTGGTTCT CGCGACTCAA AAAACGACGC
CGTGATTGGT GGTTGAGTTA TCTCTGA
 
Protein sequence
MSSRPFLSLN QATSARSKRL QSVDILRGAI MMLMAIDHIR DFVHRGAMQF SPTDLTRTTA 
PIFLTRWITH FCAPVFFLTA GIGAFLWMSR GNHTKRELSW FLLTRGLWLI LIENTILRVV
MFSQVSYRGS VIILLILWGL GASMIALAAL AHLPIRVLAP LSLLVIVIHN AFDPLTADKF
GRFAWLWDIL HQQGLFTVAG FNFVTAYPIV PWIFVMSAGF CLGTVFLWDL ARRQSFLLRL
GLTMTAAFFV VRGINIYGDP SRWIHQSTAT LTVLSFLNVT KYPPSLEFLL MTLGPAFIVL
SRLENMGLSE ANPFVVFGRV PFFYYATHLF VIHLGSILMN FVYYRHTSFL LLPAPSMGGD
PKLFPPDFGF PLWVVYAFWL ATLAALYPAC LWFSRLKKRR RDWWLSYL