Gene Acid345_3234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3234 
Symbol 
ID4072569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3830390 
End bp3831670 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content60% 
IMG OID637985255 
Producttype IV pilus assembly PilZ 
Protein accessionYP_592309 
Protein GI94970261 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0626406 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACTAG CATCTGAGTC AACATTTCCC CCAAATCTCC TTCACGTCGG GGTTCATACC 
CCGAGCTTGG AAAAAATCAT GGCTCGCAGG CGGGAAGAAC GAATCATCAT CAGCTTGCCC
GTGCGTTTAT GGGGCATGGA CGTAAATGGG AAACCGTTTA CGCAAAATGC CTCGTCCGTT
GACATCACCC GAATGGGCGC CCGCATTCAG GGTGTCACGG CACAGATTCA GCATGGGGAC
ATCATCGGCG TCCAGCACGG CAGCGACAAA GCTCGCTTCC GCGTTATCTG GGTCGCTCGG
CCGGGATCGC GTAACGAAGG CCAGATCGGC GTGCATTGCG TCGAGGCCAA CAAGTACATC
TGGGGGACCG TCGAGCCGAG CAATCAGGCC GATACCTGGG ATCCCGAAGC GGCTTCGGCC
TCAACCCACG CGGTGGGCGC CGGCGGCGGA GTCGCCGCCG TCATGGCTGC CTCGCCATCG
CACAAAGACA ACTTCGTGAA CGACGCCACC CGCGAGGGCC GGCGCCGCAT GCCACGCTAC
GCGTGCCGCG GTGGTGGTGA AATTCGCCAG CCCGGAATGA AGACCGTGGT GTGGGGCTCG
ATGACCGACA TCAGCCGCAG CGGATGCTAT CTCGAAACGC TGACCACGCT GCCGCGCAAT
GCGAAGTGCG AACTCATGCT GAACGTAGAA GGCATTGAGG TACGCGCCGG CGCGGAAGTC
CGCGTCTCGC ATCCTTCGAT GGGCATGGGC TTGCAGTTCA TTGACGTCGA TCCGACCGAT
CAGAAGAAGC TCGATGATCT GCTCGTGAAA CTTGCGGGTG GCAAAGAGCC GGAAGATCGC
ATCGTGCATC CGGTGAGCAA TGAGTTCGCG ACCGCGATCG CCTCAGCGGC GTCGCAGCTT
CGTGACCTCG AAGCCTGCGT ATCGGAAAAC GAGGAAAACG TCGATCCACG GCTGCTCTCC
GAGTTCCGCA GCGCCGTCGA TCATGCGCGT TCCACCACTG CAGCAATTGA GCAGTGGGTC
GATTTGCAGG AGCAGGACCG CGATCCATTC CCGGTTCTCG CGGCGATTGA AACGAGCAGG
ATCCGTTTGA CTGCGAGCTT CATGCGCGAG CTGGTGATGG ACATTGACGC CGCAACTTTG
CATCTCGGCA GCGAAGGTGT GAAGGAGTTG TACGAGGCGG CACGTCAGTT GCACCTGCGC
ATTGAGCAGA TGATTGCAGA TGCGACCGAG CCGGAAGATC TGCTCGACGC AGACGACGAC
CACGCACAAT CAGCCGACTA G
 
Protein sequence
MRLASESTFP PNLLHVGVHT PSLEKIMARR REERIIISLP VRLWGMDVNG KPFTQNASSV 
DITRMGARIQ GVTAQIQHGD IIGVQHGSDK ARFRVIWVAR PGSRNEGQIG VHCVEANKYI
WGTVEPSNQA DTWDPEAASA STHAVGAGGG VAAVMAASPS HKDNFVNDAT REGRRRMPRY
ACRGGGEIRQ PGMKTVVWGS MTDISRSGCY LETLTTLPRN AKCELMLNVE GIEVRAGAEV
RVSHPSMGMG LQFIDVDPTD QKKLDDLLVK LAGGKEPEDR IVHPVSNEFA TAIASAASQL
RDLEACVSEN EENVDPRLLS EFRSAVDHAR STTAAIEQWV DLQEQDRDPF PVLAAIETSR
IRLTASFMRE LVMDIDAATL HLGSEGVKEL YEAARQLHLR IEQMIADATE PEDLLDADDD
HAQSAD