Gene Acid345_1402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1402 
Symbol 
ID4068743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1700125 
End bp1701594 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content60% 
IMG OID637983411 
Productflotillin 
Protein accessionYP_590478 
Protein GI94968430 
COG category[S] Function unknown 
COG ID[COG2268] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0725365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0723742 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTAC TTTCGAAAAT TCCTGGAGAC GTGGTGGTGA TCACCGGCCT GATCGTAGTC 
GTCGTCATGT TTCTCATGAT GATGATGGCG CGCTTGTACC GCAAAGCGGG CCCGCATGAG
GCGCTCGTGG TGTACGGATT CCGCGGGACG CGCATCATCA AGGGCAAGGG CACCGTCATC
TTCCCAATGG TCGAAAACTG CTTGCAGCTT TCACTCGAAC TGATGTCGTT CGATGTGGCG
CCGCAGCAGG ACCTCTATAC CAAGCAGGGC GTCGCGGTAA CGGTCGAAGC GGTGGCGCAG
ATCAAGGTGA AGTCCGACCC GATCTCGATC CAGACGGCGT CTGAACAGTT CCTCACCAAG
ACGCCGCAAC AGCGCGAAGG CCTGATCCGC CTGGTGATGG AAGGCCACCT GCGCGGCATC
ATCGGCCAGC TCACGGTGGA AGAAATCGTG AAGCAGCCCG AGATGGTAGG CGATCGCATG
CGCGCGACTT GCGCCGACGA CATGAGCAAG ATGGGCCTTG AAGTCATCAG TTTCACCATC
AAGGAAGTTC GCGACAAGAA CCAGTACATC ACCAACATGG GCCGACCGGA TGTAGCGCGC
ATTAAGCGCG ATGCCGACAT CGCAACCGCT GAAGCCGAGC GCGATACCGC CATCAAGCAA
GCGGCCGCAC AGCGCGAAGC TGCGGTTGCC CGCGCACAAG CCGACCAGGA AAGAGTCGCT
GCCGAGACGG CTTCGCAGGC GAAGCAGGCG GAAGCGCAGC GCGATCTGGA GGTCAAGCGA
GCCGCTTACC AGGAAATGGT GAAGAAGCAG CAGGCGCAGG CCGACAAGGC TTACGAAATC
CAGACCAACG TCATGCAGCA GCAGGTGATC GCTGAAAGCG TGAAGGTGCA GCAGATCGAG
AAGCAGGAAC AGGTGAAAGT GCAGGAAGCG GAAATCCTGC GCCACGAGAA GGAACTGATC
GCCACCGTGC TGAAGGGCGC GGAAATCGAA AAGGCCCGCA TCGAGACGCT CGCCTCCGCC
GAACGCCAGC GCCTGATGAT GGAAGCCGAA GGCCGTTCGA GTTCCATTCG CGCTCAGGGC
GAAGCCGAAG CCGAGATCAT CTTCAAAAAA GGTGAAGCCG AGGCGAAGGC GATGAACGTG
AAGGCCGAGG CCTTCCAGGA GTACAACCAG GCCGCGGTCA TCGACAAACT CCTCAGCAAC
ATGCCCGAGA TCGTTCGCGC TCTGGCCACC CCGCTCAGCC AGGTGGACAA GATCACGATC
GTTTCCACCG GCAACGGTTC GTCGGCTGGG GCGCACAAGA TCACCGGCGA TATCGCGGAA
ATGGCCGCGC AGGTACCGGC GCTGTTCGAG GCACTGAGCG GCATGAAGAT GGCAGACCTG
CTGTCGAGGG TACGCACCAT TGGCGACAAG GCACCGAAGC CAGATGTGCT GCCTCCGGAC
GACGGCAGGG CAAAAGGGGC AGGGGCGTAA
 
Protein sequence
MNLLSKIPGD VVVITGLIVV VVMFLMMMMA RLYRKAGPHE ALVVYGFRGT RIIKGKGTVI 
FPMVENCLQL SLELMSFDVA PQQDLYTKQG VAVTVEAVAQ IKVKSDPISI QTASEQFLTK
TPQQREGLIR LVMEGHLRGI IGQLTVEEIV KQPEMVGDRM RATCADDMSK MGLEVISFTI
KEVRDKNQYI TNMGRPDVAR IKRDADIATA EAERDTAIKQ AAAQREAAVA RAQADQERVA
AETASQAKQA EAQRDLEVKR AAYQEMVKKQ QAQADKAYEI QTNVMQQQVI AESVKVQQIE
KQEQVKVQEA EILRHEKELI ATVLKGAEIE KARIETLASA ERQRLMMEAE GRSSSIRAQG
EAEAEIIFKK GEAEAKAMNV KAEAFQEYNQ AAVIDKLLSN MPEIVRALAT PLSQVDKITI
VSTGNGSSAG AHKITGDIAE MAAQVPALFE ALSGMKMADL LSRVRTIGDK APKPDVLPPD
DGRAKGAGA