Gene Acid345_4387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4387 
Symbol 
ID4073293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5200767 
End bp5202275 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content59% 
IMG OID637986420 
Producthypothetical protein 
Protein accessionYP_593461 
Protein GI94971413 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTT CAATCCTCTT CGCGTTCCTT GCGTGCGTCT GCGCAGCATA TGGCCAGCCG 
AACACATCTT TCTACGTTGC GACGACCGGG AAGGATTCAA ATGCCGGCAC GCAAGCAGCG
CCGTGGCGCA CCATTCAACA TGCCGCAGAC ACGGCGCGCG CGGGCAGTAC CGTCAACGTG
CGCGGCGGAA CCTACGAAGA GCTGGTGAGC CTCCACGCAT CCGGCAACGC CAGCGATGGT
TTCATCACGT TTCGAAGTTA TCCCGGCGAG GCGGCGATCC TCGAGGCCGA GCACATCACG
CCGCCCGGAA GGACCGGCGT ACTGACGATT CACGACCAGA GTTATGTGCG GGTCGAAGGC
TTTGAGATTC GCAACTTCCG CACCGCCGAG CATGACCTCG CTCCGCTAGG CATCGACGTA
ATGGGCGCGG GCTCTCACAT CGAGCTGCTG AAGAACAACG TCCACCACAT TCAGCAGACA
TTTGAAGGGC GCGATGGTCC GGGGCACGGC GCCAACGCGT TTGGCATTGC GGTGTACGGA
ACCAGCGCCA AAACTCCGAT CACGGATTTG GTCATCGATG GCAACGAGGT GCATCACCTC
AAGACCGGTT CGAGCGAGTC GGTGGTGGTG AACGGGAACG TCACCAACTT CCGCATCACG
CACAACGTTG TGCACGACAA CAACAACATT GGCATCGACG TCATCGGTTT CGAGCATACC
GCGCCCGACC CGGCGGTGGA CCAGGCGCGC GACGGGCTCG TCAGTGGCAA CTTGGTTTAC
AACATCACCT CGAAGGGCAA TCCCGCTTAC CGTAACGATG AATCTTCCGA CGGCATTTAC
GTGGACGGCG GCACCCGGAT CCTTATCGAA CACAACGTAG TTCACGATGT GGACTTCGGC
ATCGAGCTGG CGAGTGAGCA CAAGGACCGC GCCACCAGCT ACGTCATCGC GCGCAACAAT
CTCGTCTATC ACAACCACAC CGCCGGTGTT TCCATCGGCG GCTACGATCC GCAGCGCGGA
CACACCGAGC ACTGCACGGT GATCAACAAC ACGCTCTACG ACGACGACAC CTCGGCCACC
GGCTCCGGTG AGTTCCAGAT GCAATGGAAC ATGGCAGACA ATATTTTCGC GAATAACATC
GTGTACGCCG GGCCGCAGTG CCTGATGACG ATTCTCAAAA CTGAAGTCAA GCCCGGCCAA
CCGCCCGCGA ATATCGATCA CAACCTCTAT TACTGCGCTT CCGGTGCCAA GGCGAGCACG
TGGAAAAACA CTGCCGCCAC TGTGACGGGA TTTGAAGAGT ACTCGCAGGC CAGCGGCAAT
GACCGCAATT CGCATTTTCA GGATCCCCAT TTTGTCGACG CTGCCGCGAA GGACTTCCAC
CTGCAGCCAG ACTCTAAGGC CATCGCCGCA GGAGCCATTG ACGGAATGCC GGTGGGAGCA
CTGGATCTTG ACGGCTCGCC GCGGACGAAA TCTGGCAACA TCGACATCGG CTGCTACCAA
CGAAAATAG
 
Protein sequence
MKISILFAFL ACVCAAYGQP NTSFYVATTG KDSNAGTQAA PWRTIQHAAD TARAGSTVNV 
RGGTYEELVS LHASGNASDG FITFRSYPGE AAILEAEHIT PPGRTGVLTI HDQSYVRVEG
FEIRNFRTAE HDLAPLGIDV MGAGSHIELL KNNVHHIQQT FEGRDGPGHG ANAFGIAVYG
TSAKTPITDL VIDGNEVHHL KTGSSESVVV NGNVTNFRIT HNVVHDNNNI GIDVIGFEHT
APDPAVDQAR DGLVSGNLVY NITSKGNPAY RNDESSDGIY VDGGTRILIE HNVVHDVDFG
IELASEHKDR ATSYVIARNN LVYHNHTAGV SIGGYDPQRG HTEHCTVINN TLYDDDTSAT
GSGEFQMQWN MADNIFANNI VYAGPQCLMT ILKTEVKPGQ PPANIDHNLY YCASGAKAST
WKNTAATVTG FEEYSQASGN DRNSHFQDPH FVDAAAKDFH LQPDSKAIAA GAIDGMPVGA
LDLDGSPRTK SGNIDIGCYQ RK