Gene Acid345_3941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3941 
Symbol 
ID4071324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4662323 
End bp4663300 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content57% 
IMG OID637985967 
ProductTPR repeat-containing protein 
Protein accessionYP_593015 
Protein GI94970967 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.652305 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCCA TTTACCTAGG TGTTCTTTTC ATCGCAGCGT TTGCGATGCA ACTTGCGGCC 
CAAGCTCCGG CGAGCATCCC GAAGGTGACA GACCTCTCCG TGCGCGTGGT TTTCGAGAAC
GGGCGTTCCG CTGGACCTAA CAATCGCGTC GAGGTATTGG GCCAGTACGG AGGCAGCGTG
ACCTCTGGAT CTACGGATAC GTCGGGCCAG GTGACGTTTC CAAGGATGGA CCCGGGCAAC
TACAGATTGC GAGTGTCGGG ACCCGGAATC GTGACGACGG AAACACCTGT CATTGACCTC
ACTGATGCAG GCCCGCGCTC CAACCAAACG GTGCCGGTGA AGCCGTCCGG CCAGATGGGC
GATTCGGCGC CGGGAGCGAC TGTGGACGCG AATATCCCAG CGGATGCTAG GAAAGAATTC
GATAAGGGCG AGGACAAGTC GCAAGGGAAA GATTACAACG CTGCGCGGGA GCATTTGGAA
AAAGCAGTCA CGATCTATCC CAAGTATGCG ATGGCCTACA ACGACCTGGG TTTGGTGTAC
ATGCACTTAA ACCAGGGCCC CAAGGCGGTG GAGGCGTTCA AGACGGCGGC GCAGTTGGAT
GAACATTTGA AACAGGCGAA CCTGTTTCTC GGCCAGTTCT ATTACGAGAA CCACCAGTTC
AAGGACGCCG AGCCGTATCT GGTTCATGCC ACCAAAGACG ATCCGAAGAA CGCACAGCTG
CTGCTGGCTC TTGCGAACAG CCAATTAAGG AATGGGCAGA ACGACGAAGC ACTCGCGACC
GCGCAGAAAG TGCATGCGTT GCCTGACCAT AAGAAATTCG CTGCAGCGCA TCTGATCGCT
GCCGAGGTAT ATGCCGACAA GGGCGACAAT CAGCACGCGA AAGACGAGTA TCACGTTTTC
CTGAAAGAAG ATTCCAACTC GCCGATGGCC CCCAAGGTGA AAGAAGCCCT GGCGAAATTG
GAAGCCCCGG CGAAGTAG
 
Protein sequence
MKPIYLGVLF IAAFAMQLAA QAPASIPKVT DLSVRVVFEN GRSAGPNNRV EVLGQYGGSV 
TSGSTDTSGQ VTFPRMDPGN YRLRVSGPGI VTTETPVIDL TDAGPRSNQT VPVKPSGQMG
DSAPGATVDA NIPADARKEF DKGEDKSQGK DYNAAREHLE KAVTIYPKYA MAYNDLGLVY
MHLNQGPKAV EAFKTAAQLD EHLKQANLFL GQFYYENHQF KDAEPYLVHA TKDDPKNAQL
LLALANSQLR NGQNDEALAT AQKVHALPDH KKFAAAHLIA AEVYADKGDN QHAKDEYHVF
LKEDSNSPMA PKVKEALAKL EAPAK