Gene Acid345_1882 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1882 
Symbol 
ID4073041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2260349 
End bp2261500 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content58% 
IMG OID637983891 
ProductYVTN beta-propeller repeat-containing protein 
Protein accessionYP_590957 
Protein GI94968909 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02276] 40-residue YVTN family beta-propeller repeat 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0454626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCAGG TGCCCCGCTT CAGATTCCAA GCTGCGCTCG GCGCGGCGAT GTTGCTGTGC 
CTGGCCGGCT GTCGACCTGA AAGCAATGTC TCGCCCAATG ATCACGAGTT CGCCTACGTG
AGCAACGGTA AGAGCGGCAC CGTCAGCGTG ATAGACGCCC GCTATTTTCG TCCGGTCACT
ACGCTCATCG TTGGCAAAGA GCCAACCGGC ATCGCGATCA ACCCGGTAAA TAACGAAGTG
TATGTGGTGA ATGCCGGGTC CGATTCGGTG TCGGTGATTG ACTCCGCGAA GAATGAGGTT
CTGGCAACGA TCGGGGTTCA CAAGGCGCCG TATTTCATCT CCGTCAGCGC CGATGGGAAA
CGTGGCTACG TGGCGAATTC CGGGTCCGGC AATTTGTCGG TGCTCGACCT GGAGAAACAT
TCGGAGATCG CGACCGTCGC TGTGGGCTCA GCGCCAGGTG AGGCTGCGGT TACGCCGGAC
GGGAAGACGG TGGTGGTCTC GAACCGTACA GCCAATTCGG TTTCGCTCAT TGACGCCGCA
AAAAGCAGCG TGCGGACGAC GCTTTCGGTT TGCAAGGAGC CACAGGACAT CGTGGTGATG
AAGAATTCGG CGAAGGCGTT TGTGGTCTGT CCGCCGACGA ACCAGGTAGC GGCTATCGAT
CTGAAGCGCG ACGTGGTGCT CGCGCTATTG GACGTTGGGG GGTCGCCGAT TCAGCTCACG
ATAAAGCCGG ATGGCGGCGA AATCTTCGTC AGCAACTTTG GCTCGAACAA TATTTCCTCG
ATCGAGGCGT ACACCAACGA AGTCGGCAAC TCGTTTCCTA TCGGCGACAA ACCCACGATG
GCCGTGACCA CCAGCGACAA TTCCCTGCTG TACGTGAGCG ACTTCGGCTC CGATTCGGTG
TCGGTGTACA GCATTGACGA CGGCAAGGCG ATTGATTCGA TCCAGGTCGG CAGCCGTCCG
GATGCGTTAG CTTTTACCAG CGACGAGTCG CACCTGCTGG TGGTCAACAG CGGCTCGGGC
GACGTGGCTG TGGTGCGCAC CGTAAAGACC AAGCAGAACT CGAAGATCAG CCGCGAACGC
TCGCTCGTGA CGATGATCCC GGTTGGCGCA CAGCCCAATG CAATCGCCAT TAAGAGCGTA
ACAACGAAAT AG
 
Protein sequence
MMQVPRFRFQ AALGAAMLLC LAGCRPESNV SPNDHEFAYV SNGKSGTVSV IDARYFRPVT 
TLIVGKEPTG IAINPVNNEV YVVNAGSDSV SVIDSAKNEV LATIGVHKAP YFISVSADGK
RGYVANSGSG NLSVLDLEKH SEIATVAVGS APGEAAVTPD GKTVVVSNRT ANSVSLIDAA
KSSVRTTLSV CKEPQDIVVM KNSAKAFVVC PPTNQVAAID LKRDVVLALL DVGGSPIQLT
IKPDGGEIFV SNFGSNNISS IEAYTNEVGN SFPIGDKPTM AVTTSDNSLL YVSDFGSDSV
SVYSIDDGKA IDSIQVGSRP DALAFTSDES HLLVVNSGSG DVAVVRTVKT KQNSKISRER
SLVTMIPVGA QPNAIAIKSV TTK