Gene Acid345_2914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2914 
Symbol 
ID4070838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3461680 
End bp3463314 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content57% 
IMG OID637984933 
Producthypothetical protein 
Protein accessionYP_591989 
Protein GI94969941 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2982] Uncharacterized protein involved in outer membrane biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCCCC ACGAGGCGTC TTTCCGTTAT ACGCTTTGGG CGGCTTTGCA GTTCCATCTA 
GTCGAATTCG CAAGCTGCTG CATCCAATAC TTTTGGGTGG GTAAAACAAA CACAATCCGG
TGGCTGGTTG CGGCAGCGCT CATCGTGGGC GGCGTATTTA CGGTCAATTT CTATCTCGAT
CGCGAGGTGC AGACGCTCGC CCGCGACATG CTCGCCGAGC ACTTCCATAG CGAAGTTGCG
CTCGACACAA TTCACGTGCG CTTGCTGCCA ACGGCGAGGG TCAGCGGCTC GGGGCTGGTG
CTGTCACAGA CCTACGGCGG CGCGAAGATA CCCTTTATCA GCGCGCAGAA GTTTTCGGCG
AACGCATCGC TCTGGGGCAT GTTGTCGCAT ACGCGGCAGC TCCATCATGT GCGCGTGGAA
GGACTGATTA TTACGGTGGC GCGCCACCCG CAAGGCGAAA AACAGCCGAC GGCGAAGCGC
AAGGTCCCGG AGCTCGATAT CGACGAAGTG GAAGCGGATG GCGCCAAGCT TGTGATTTTG
GCGAACAAAC CCGGCAAGCG AAGCCTGGTG TACGACATGC ACCATCTTGT TCTGACGCAA
GTAGGAAAGA GCAACGTGAT GCACTATCTC GCGAATCTGC GGAATGCGCT GCCGCCGGGC
GAGATCGAAG CCGAAGGGAG TTTTGGTCCC TGGAATTTCG ACGACGCTGG AGGGACGCAC
CTCGACGGCA ACTACGTTTT TAAGAAAGCC GATCTCGGTG TGTTCAAGCA GATTTCGGGA
ACGCTGTCGT CTACCGGGTC GTTCACTGGC GAGTTGGGCA GAATTGATGT GAAGGGCTCA
ACCGAGACGC CGGATTTTGC AGTGAAGTCC GGTTCGCATC CGGTGAATTT GCATACGGAT
TTCGACGCCA CCGTGGACGG AACCAACGGC GACACACAAT TGCACAGCGT GCGGGCCAAG
CTGCTCAACT CGACCTTTGT TGTGAATGGA ACAATTGTTG ATGTACCGGG GCCGACGGGG
CACATCATTT ATCTGCATGT CGTTTCCGAC GATGCAAAGG TACAGGACAT GCTGCGGGTG
GCGGTAAAGA CGCCACCGGC GATGAAGGGC GGCCTGAAGT TCGACGCGAA AGTAAAGATC
AACCCCGGAC AGGGACCGGT ACGAAACCGC ATTACCGCGG AGGGCCGCGC CTATATCGTG
AACGGATACT TTTCGAGCGA GACGGTCTCG GAGAAGATCG CGGAGTTGAG TAATCGCGCC
CAGGGAAACC CCAAAGGAGA TAAAGACGCG CAGGTACCTG CGAAGTTCGA TACGGCTTTC
CGACTGGATG CGGGCAAACT CGCAATTCGC TCGCTTAATT TCAATGTTCC GGGGGCGGAA
GCGAAGCTGC ATGGGACGTA CGTATTGGAC GATCAAACAC TCGATTTCTC AGGGACGGCG
ATGTTGCAGT CCACCGTCTC GGAGATGACG ACGGGATTCA AGTCGCTATT GTTGAAAGCC
GTGGATCCGA TGTTTAAGAC GAAGAATGCC GGCACAGTGC TGCCGATCAC GATCACGGGC
ACGCGCGACG AACCGAAGTT CAAGGTTCAG ATGAAGCGCT TAGGAGAGGC CAAGAAAGAG
GCGCAGAGCA ACTAG
 
Protein sequence
MPPHEASFRY TLWAALQFHL VEFASCCIQY FWVGKTNTIR WLVAAALIVG GVFTVNFYLD 
REVQTLARDM LAEHFHSEVA LDTIHVRLLP TARVSGSGLV LSQTYGGAKI PFISAQKFSA
NASLWGMLSH TRQLHHVRVE GLIITVARHP QGEKQPTAKR KVPELDIDEV EADGAKLVIL
ANKPGKRSLV YDMHHLVLTQ VGKSNVMHYL ANLRNALPPG EIEAEGSFGP WNFDDAGGTH
LDGNYVFKKA DLGVFKQISG TLSSTGSFTG ELGRIDVKGS TETPDFAVKS GSHPVNLHTD
FDATVDGTNG DTQLHSVRAK LLNSTFVVNG TIVDVPGPTG HIIYLHVVSD DAKVQDMLRV
AVKTPPAMKG GLKFDAKVKI NPGQGPVRNR ITAEGRAYIV NGYFSSETVS EKIAELSNRA
QGNPKGDKDA QVPAKFDTAF RLDAGKLAIR SLNFNVPGAE AKLHGTYVLD DQTLDFSGTA
MLQSTVSEMT TGFKSLLLKA VDPMFKTKNA GTVLPITITG TRDEPKFKVQ MKRLGEAKKE
AQSN