Gene Acid345_4447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4447 
Symbol 
ID4070930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5279323 
End bp5280687 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content58% 
IMG OID637986486 
Producthypothetical protein 
Protein accessionYP_593521 
Protein GI94971473 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAACC GCCGATGTTC GGGGGCCATG ATTCTATTTG TCAGCCTGTT GCTGCCCACA 
ATCGCTCGAG GGCAGGAGCA GCCGAGTACC TCTCAACCAT CGGCTGAGGC GCTCGCTAAA
CATCTTGAGG AGATGGAACG TGAGATCAAG GAATTGCGGG CCCAGGTGAA AGTGCTAACC
GGTGAAAAAG CGGCGGCAGA GACCGCGCCA GCCGCGTCAG GGGCGTCGTC TGCTGTTGTG
CAGAACTCGC TGGTAAGCGG CACTCAACCT GCCGCCGCTC CTTCGGCAAC CCCATCGCTG
GCTTCGATTC TCGGGCCAAC CACGCTGAGC GGATTCGTGG ACGTGTATTA CGGCCAGAAC
TTCAACAATC CCGAAAGCCA GAACAACGGC TTGCGCTATT TCGATCAGGG CGCAAACCAA
TTCGGTTTGA ACTTGATGGA GTTGGTGATC GACAAGACGC CGGATCCCTC GAACAGCCGT
ACCGGCTACC ACGTTGCCCT CGGCTATGGC CAGGCGATGA ACGCGGTCAA TGCCTCCGAA
CCCAAAGCCG GGCTGGGCTT CGATCAGTAC CTGAAGGAAG CCTACTTCTC GTATCTCGCG
CCTGTCGGAA AAGGGCTGCA ATTTGACGTC GGCAAGTTCG TCACGCCCGC CGGCGCCGAA
GTGATCGAAA CCAAGGACAA CTGGAACTAC TCGCGTGGCG TGCTCTTCTC GTATGCCATC
CCGTATTTCC ACTTCGGCAT GCGCACCAAG TACACCTTCA ACGACAAATA TGCGCTGACC
GGTTTCTTCA TCAACGGTTG GAACAACGTT GTGGACAACA ACACCGGCAA GACCTACGGG
GTCAACTTCG CATGGAACCC CAACAAGAAG TTTGGAATCG CCCAAACCTA CATGGCGGGT
CCGGAAGAGA ACGGCCTCAA CCACAACGTG CGCCAGTTGA GTGACACGGT CTTCACCTAC
ACGCCGACAG CGAGACTTTC GTTCATGTTG AACGGCGACT ACGGTCGTGG CGATCGCTAC
GTCACCGACA CCGAAGCGAA CACCTTTTCG CATGCGGTGC ACTGGACGGG CGTAGCAGGC
TACGCAAAGT ACGCATTGGC CCAGAACATG GCCATCGCCG GCCGATATGA GTACTACGAC
GACGCCGACG GCTACACGCT CGGAACCCTG ACAACGACCC ACGTCAACGA ATTCACTGCC
ACCTTCGAAC GGATCATCGG ACACCACATC ATCAGCCGCT TCGAGTTCCG TCGAGATATG
TCGAACCAGC CGCTGTTCTA TAAGGGCAGC AATCCGGTCA CTGACCAGAA CACGCTGACC
GCGGGCTTGG TTATGACCTT CAACAGCGGG GAGGGCGGCA AGTGA
 
Protein sequence
MRNRRCSGAM ILFVSLLLPT IARGQEQPST SQPSAEALAK HLEEMEREIK ELRAQVKVLT 
GEKAAAETAP AASGASSAVV QNSLVSGTQP AAAPSATPSL ASILGPTTLS GFVDVYYGQN
FNNPESQNNG LRYFDQGANQ FGLNLMELVI DKTPDPSNSR TGYHVALGYG QAMNAVNASE
PKAGLGFDQY LKEAYFSYLA PVGKGLQFDV GKFVTPAGAE VIETKDNWNY SRGVLFSYAI
PYFHFGMRTK YTFNDKYALT GFFINGWNNV VDNNTGKTYG VNFAWNPNKK FGIAQTYMAG
PEENGLNHNV RQLSDTVFTY TPTARLSFML NGDYGRGDRY VTDTEANTFS HAVHWTGVAG
YAKYALAQNM AIAGRYEYYD DADGYTLGTL TTTHVNEFTA TFERIIGHHI ISRFEFRRDM
SNQPLFYKGS NPVTDQNTLT AGLVMTFNSG EGGK