Gene Acid345_3822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3822 
Symbol 
ID4071106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4518100 
End bp4519287 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content56% 
IMG OID637985845 
Producthypothetical protein 
Protein accessionYP_592896 
Protein GI94970848 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.333872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.578419 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGCG ATTTCGAAAA CCAATTCCTC GCAAAGGTAG TACCTGGCCC AGCCGAACGT 
CTTGTGTTGC TTGCGTGCTC AAGCGATTCG GGGACCAACA CTGAGATCAA AGCGCTTTAC
GAACAGGTCG GAGACGAGGT CGCGTGGCAA GTCGCAAAAC AGCACGAGCT TGAAGGAAAT
CTCGGGCACC GGTTGATCGA CATTTTGGGT GAGCAAGTTC CACTCCGTTG GCGCGCAGCG
CATGAGACGG TGGGGAACAG GATTGGTGCG TATCTGAACG AAGTGGACCG AATGGCGGCT
CGGTTAGCAC AACAGGACAT TCCTCTGGTT GCGCTAAAGA ATGCTGGTAT CGCCCGCGGT
GTGTATCAGT GCGCGGGGTG CTCGCCAATG GGCGATGTCG ATCTCCTGGT TCGGCGCGCC
GACTATCGGC GTGTCCACGC AATTCTGTTA GAAGAAGGGT TTACGTGTGA CTCGCGGAAT
GTCACTGAGG AAGGGACCTT GGAAGAGGGC GAGGTAACGG GCGGAACCGA ATATCACAAG
GAGATTCCGG ACGTTGGTAC GTTCTGGTTA GAACTGCAGT GGCGGCCAGT TTCGGGTCGA
TGGTTACGTC CGGATCAGGA GCCAAATGGC GACGAACTCG TTAGTCGGTC TGTTCCGATT
GAAGGCACCC ATTTGCGGCT GTTGAACCCT GAGGACAACC TCTTACAGGT GTGTCTCCAT
ACGGCGAAGC ACACATACCT GAGGGCACCA GGGTTGCGCT TGCACACCGA CGTAGAGCGC
ATCGTGAGGC AGTTGCAAAT CGATTGGGAG GCCTTCCTCG CAAAGGCGAA GGCCCTTCAG
GTGCGGACTT CGACTTACTT TTCACTTTGG CTGCCGGCGC GGCTTCTCAA CACTCCAGTG
CCTGACGCTG TGTTGTCGGA ACTCGCGCCC TCTCGGCGGA AGCGCAAGGC GATACTCAAG
CGTTTGCAAA GAGCGGGACT GTTTTATCCG GCCCGACCGA AGTTTTCCAA CATCGCGTAC
ATTCGGTTCA ATAGCCTTTT GTACGACAGT TCGAACGGAT TGGTCCGGGC GATTTTCCCC
GACACAGAGT GGATGAAGAA GCGGTATGGT TTCCGGAGCG CCCTTTTACT CCCCTATTAC
CACGTGCGAC GCATCGCGGA TTTGGGGCTG CGTCGAGTTG GGATTTGA
 
Protein sequence
MTSDFENQFL AKVVPGPAER LVLLACSSDS GTNTEIKALY EQVGDEVAWQ VAKQHELEGN 
LGHRLIDILG EQVPLRWRAA HETVGNRIGA YLNEVDRMAA RLAQQDIPLV ALKNAGIARG
VYQCAGCSPM GDVDLLVRRA DYRRVHAILL EEGFTCDSRN VTEEGTLEEG EVTGGTEYHK
EIPDVGTFWL ELQWRPVSGR WLRPDQEPNG DELVSRSVPI EGTHLRLLNP EDNLLQVCLH
TAKHTYLRAP GLRLHTDVER IVRQLQIDWE AFLAKAKALQ VRTSTYFSLW LPARLLNTPV
PDAVLSELAP SRRKRKAILK RLQRAGLFYP ARPKFSNIAY IRFNSLLYDS SNGLVRAIFP
DTEWMKKRYG FRSALLLPYY HVRRIADLGL RRVGI