Gene Acid345_2913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2913 
Symbol 
ID4071215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3459895 
End bp3461652 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content57% 
IMG OID637984932 
Producthypothetical protein 
Protein accessionYP_591988 
Protein GI94969940 
COG category[S] Function unknown 
COG ID[COG4805] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTTC TTTCCACGCT CCTCGCACTC TGCATGTTCG CGGGAGCACT CATGGCGCAA 
GGCAGCGACG TCGACAACCG GCGCAAGCAA CTCAACCAGA TTCTCGACGA CTACTGGGAA
TACACCATGC GTACGAATCC TGAGTACGCA TCCATCCTCG GCGATAAGCG CTACAACGAT
AAACTGAGCG ATGCGTCGTA CGCCGCGGTC CTGCGTGACC TGGACGAAAC CAGGAAATAT
GTAGCTCGCC TCGAAGCAGT CGATACGGCG GGCTTCCCCG AGCAGGAGCA GCTCAACAAG
GAATTGTTGC TTCGCCAACT GAAGATGGGT CTCGAAGGTG CTCGCTTTAA GACTTGGGAG
ATGCCAGTCA ATCAATTCAG TGGCATCCAC ATTGATGCGC CACAGCTCGT CAACTTGCTC
TCCTTCGAAA CCGTGAAGGA TTACGAAGAC TTCATCGCCC GTCTCAATCT TCTTCCTCTG
GTTTTCGACC AGACCGAAGA CCTCATGCGC ATGGGGATGA AGAGCGGCCG CATTCCCCCG
AAGATCCTGC TCGATCAAGT TGTAAAGCAG GCCAAGGGCC TCGCCGAGAC GAAGGCGGAG
GATAGCCCCT TCGCCGGCCC GGTGAAGAAG TTCCCGGATG GCATCAGCGC TGCGGACCAA
AAGCGACTGC ACGACGCCGT CATCACCGCG ATCTCAACCA AGGTCACCCC GGCTTACGAA
GAGTTCACCA AGTTCGTCGC CGACGAATAC GCTCCCAAAG GCCGAACTGA TCCAGGCGCA
TGGTCGCTCC CCGATGGCGA AGCACTCTAT GCCTTCATGG TGAAGCAGAG CACCACCACC
GACAAAACGC CCGAAGAAAT TCACCAGCTA GGCTTGCAGC AAGTCGCGGC GGATCGCGCT
GCCATGCTCG AAATCGCCAA GAAGATGGGC TACAGCGACC TGAAGTCGTT CGAAGCCACG
ATCAAGGACA ATCCCAAGCT CCATGCTCAA TCGCGCCAGC AGATCCTCGA CATCTACCAG
AAATACGAAG ACCAGATGTG GGCGAAGCTG CCGGAGCTTT TCGGCACCCT GCCAAAAGCA
AAAGTCATCG TGATGCCGGT CGAAGAATTC CGCGAGAAAG AAGCGTCGGC GGCGCAGTAC
AACTCCGGCA CTCCTGACGG CAAGCGCCCG GGCCACATCA TGGTTAACAC CGGCGACTTC
GCCAAGCGCC TCACCATCGA CATGGAAAGC ACTGCTTATC ACGAGGGCGT TCCCGGACAT
CACCTCCAGG GCTCGATCGC ACAGGAAATT CCAACGCTGC CCAAGTTCCG GCAGCAAGCG
TATTACACCG CTTACGTCGA AGGCTGGGCG CTGTACTCCG AGCGGCTCGG CAAAGAAGTT
GGTTTTTATC AGGATCCGTA TAACGATTAC GGGCGGTTGC AGGATGATCT GTTACGCGCG
ATCCGGCTCG TGGTGGACAC CGGATTTCAC TACAAGAAAT GGACGCGGCA GCAAGTGGTG
GATTACTTCC ACAACAACTC GGCCATCGAC GAAGTCAACG TGCAGAGCGA GACCGACCGT
TACATCGCGT GGCCCGCGCA GGCATTGGGT TACAAGATGG GGCAGTTGAA ATTTATCGAG
CTGCGCGAGC GATCGAAACA GGAGTTGGGA GCGAAGTTCG ACATCCGCAA ATATCACGAC
GAGGTGATTG ATTCGGGAGC GCTGCCGCTC GATGTTCTCG AGCGCCGCGT GAATTCGTGG
ATCGCGGAAC AGAAATAA
 
Protein sequence
MRFLSTLLAL CMFAGALMAQ GSDVDNRRKQ LNQILDDYWE YTMRTNPEYA SILGDKRYND 
KLSDASYAAV LRDLDETRKY VARLEAVDTA GFPEQEQLNK ELLLRQLKMG LEGARFKTWE
MPVNQFSGIH IDAPQLVNLL SFETVKDYED FIARLNLLPL VFDQTEDLMR MGMKSGRIPP
KILLDQVVKQ AKGLAETKAE DSPFAGPVKK FPDGISAADQ KRLHDAVITA ISTKVTPAYE
EFTKFVADEY APKGRTDPGA WSLPDGEALY AFMVKQSTTT DKTPEEIHQL GLQQVAADRA
AMLEIAKKMG YSDLKSFEAT IKDNPKLHAQ SRQQILDIYQ KYEDQMWAKL PELFGTLPKA
KVIVMPVEEF REKEASAAQY NSGTPDGKRP GHIMVNTGDF AKRLTIDMES TAYHEGVPGH
HLQGSIAQEI PTLPKFRQQA YYTAYVEGWA LYSERLGKEV GFYQDPYNDY GRLQDDLLRA
IRLVVDTGFH YKKWTRQQVV DYFHNNSAID EVNVQSETDR YIAWPAQALG YKMGQLKFIE
LRERSKQELG AKFDIRKYHD EVIDSGALPL DVLERRVNSW IAEQK