Gene Acid345_1497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1497 
Symbol 
ID4069244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1821348 
End bp1823696 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content61% 
IMG OID637983506 
Producthypothetical protein 
Protein accessionYP_590573 
Protein GI94968525 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.863905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGT GGCTAAGTAT TTGTGCAGTG GTAGCGGCGA TGGCGCTGGA GTTCGGCTGT 
GGCCTGAATG GCCCCAGCGG CCCGGTTGAT AACGGCGGTG GCGGTGGGGG CGGCGGAACC
ACAGGCACAA CCATCAATGG TGTCGCGACC AAAGGCCCGC TCAACGGCGC TACCGTAACC
GTTTATGAAG TGACAGACTC CTCGGGCGCC AACGGCAGCT CGATCGGTAC GGCGACGACG
GATGCCAGCG GCAAGTTCAG TGTCACCACA AGCAAGGTGC CGAGTGGGCC GATCCGCGTC
TCGGTGAGCG GCGGCTCATT CCTGAGCGAC GTGGACGGCA AGACGTCCAT CACCAACTCA
GCGACCTTGA CCGCGCTCAT TACCGATTCC ACGAAGATCC CGAATCCGGT GAATGTGACG
GTTGCGACGT CCATGCTCGA CACCATGGCG CAGGGTTTCG CGGGCGGCAA GACGCCTGGT
GGCCAGTTGG CGAAACGCGG CGGAAACGTG ACTAAGCTTG CGGGCACTTC GTGCTCCGGC
GGCGTTACGG CCGGCATGGG CTGCGCGACG ACTTCGCTCG GTGGCTTCTA CGGCGGTATT
CCGAGCACGG GTGGCACCGG CTTTGGCGGC ACGCCAACCG TTACTTCGGC AACCGATATT
GACGCCTTCA AGATCGGCTT GCTCAGCGGC GCGGTAGAAG TTTGCGCCAA CAAGGCCTAT
CCGACGAACC CGGGTGCGTT TTTCACGGCG ATCTTCGCGG ACGCGACCGA CCTGATCTTC
GACGGCAAGA ACGCCGGCGC AGACATCTTC CTCGATCCTC CAACCAACTC CCTGAAGCTT
TCGTCCACGG CGTTGACTAC GGACTTCCTG CTCTGCTTGA ACGAGTACGT GAGCACGAGC
ACGGTGTTGT CGGTCACCGG TGGCGATCTC GACAGTGCGG TGAACTCGAT CTCCGGCGGC
GTTTCGCAGA GCCCGCTGAC GCCGAAGGCG CTTGGCCTTT CGGCCGGCAG CTCTGGTGCA
ATCGCGACGC TGGCGTTCCA GGGCAAACAG TACTTGTTCG TGGCCGCACG CAGCGCGGGT
GTGGTGGTAA TTGACATCAC TGACCCGACG AATGCATCGC CGGCAATCAA AGTATGGCCG
GGCGTAAACG GGGTCCTCGG TAACGATGTC GGTGGCGTCA TTCCGATCAT CGGCCGTGCC
GATCACGGTC AGGTTGTAGC GTTCTCATAC GGTAGCCAGC AGATGGCGCT CTTGAACGCC
GACCTGATGG TAAACGGTGA TCCGACGAAG TCCGCGGACG AGAGCGCGAT CATTGATGCG
CACTTCGCGC CGACGTTCGT CAGCACCAGC CCAGCGGACG TTAGCGGCGG CAGTGGCTTT
GTCCTCGGCG GCACACCTGA TCCTGGACGT CACGGGGTCT GGATGTCGAC GGTGGACGGG
TACAAGTTGC TGGATATTTC GTTGACCGCT CCGGACTTCG ACGCCGGCAA CAGCTACGAC
GCGTTCCCAC AACCGGGTGG AGCGAAGTAT CCGGATCCAA CGCAGGTCGC GGAAAACATG
GGCGCCGACA TCAGCCACAA TCAACTCTTC GTCGGTAACT ATTACGGCGT GCAGGTGGTT
GACCTGGCCG GCAAGGCGAG CTACTTGCTC GACGACACCA ACTGGGCGAT GCTGGCTGCG
GTTCGCAGCT CGTATACGAT TGACGGCGAT TCGGTGGATA ACGCGCTGCA GGTTGGCGTA
CTAACGTTCG AAGATACTTC GGACGTCGCC TTCCTGAACC TGGCGGGTAT CACTACCACG
CCGGGAGCAT CCGGTGCACC GGGAACGTTC TCTCCGGCGG CGGGAGGCCT GGTAGTACTC
GACACTTCGA TGGGAGGAAC AATCCCATAC CACACCTACT CCGGCTCGGC AGTGGACAGC
ACGACCCACT ACGGACTGCT GATGGCGGGC TTCTCCACAG ACATGGGAGT CTTCCAGATC
CAGGATCCGG CGTCGGTCGC GTCGGGCGGC ACCTGGGCGG GCGCATCGAA CTACTCGATG
TTCAACCTCG CTACGGGTGG CATTGGCTAC AGCGAGGCTT ATGATCCGCA CGCGGTGGGT
GCGATCTTTA ACATCGGGAA TTCCAAGGCG TATGGCTACT TGCTGGACGG CTCGAATCTT
CGCGTGTTGC AGGTTGACCT CACGGGCTTC CTCGCAGCGA CGCAGGATGC TACCGGTCAC
CAACCAGCAA CCGATCCGAC CTCTGCTGGC GGAACGATCA CTCACTTCGA TTGGACGATT
CCGACCATCA CGGGCACCAG GAAGAGCCAA CCGGTGAAGC GCGAACAGCA CATCCCGCCG
GCTCACTAA
 
Protein sequence
MKKWLSICAV VAAMALEFGC GLNGPSGPVD NGGGGGGGGT TGTTINGVAT KGPLNGATVT 
VYEVTDSSGA NGSSIGTATT DASGKFSVTT SKVPSGPIRV SVSGGSFLSD VDGKTSITNS
ATLTALITDS TKIPNPVNVT VATSMLDTMA QGFAGGKTPG GQLAKRGGNV TKLAGTSCSG
GVTAGMGCAT TSLGGFYGGI PSTGGTGFGG TPTVTSATDI DAFKIGLLSG AVEVCANKAY
PTNPGAFFTA IFADATDLIF DGKNAGADIF LDPPTNSLKL SSTALTTDFL LCLNEYVSTS
TVLSVTGGDL DSAVNSISGG VSQSPLTPKA LGLSAGSSGA IATLAFQGKQ YLFVAARSAG
VVVIDITDPT NASPAIKVWP GVNGVLGNDV GGVIPIIGRA DHGQVVAFSY GSQQMALLNA
DLMVNGDPTK SADESAIIDA HFAPTFVSTS PADVSGGSGF VLGGTPDPGR HGVWMSTVDG
YKLLDISLTA PDFDAGNSYD AFPQPGGAKY PDPTQVAENM GADISHNQLF VGNYYGVQVV
DLAGKASYLL DDTNWAMLAA VRSSYTIDGD SVDNALQVGV LTFEDTSDVA FLNLAGITTT
PGASGAPGTF SPAAGGLVVL DTSMGGTIPY HTYSGSAVDS TTHYGLLMAG FSTDMGVFQI
QDPASVASGG TWAGASNYSM FNLATGGIGY SEAYDPHAVG AIFNIGNSKA YGYLLDGSNL
RVLQVDLTGF LAATQDATGH QPATDPTSAG GTITHFDWTI PTITGTRKSQ PVKREQHIPP
AH